
User Query #1
>tr|B6V6A2|B6V6A2_APIME Silk fibroin 3 OS=Apis mellifera GN=Fibroin3 PE=2 SV=1
MQIPTFVAIC LLTSGLVHAG VEEFKSSATE EVISKNLEVD LLKNVDTSAK RRENGAPVLG KNTLQSLEKI KTSASVNAKA AAVVKASALA LAEAYLRASA LSAAASAKAA AALKNAQQAQ LNAQEKSLAA LKAQSEEEAA SARANAATAA TQSALERAQA SSRLATVAQN VASDLQKRTS TKAAAEAAAT LRQLQDAERT KWSANAALEV SAAAAAAETK TTASSEAANA AAKKAAAIAS DADGAERSAS TEAQSAAKIE SVAAAEGSAN SASEDSRAAQ LEASTAARAN VAAAVGDGAI IGLGEEAGAA AQLLAQAKAL AEVSSKSENI EDKKF

# fasta35.exe -q -H -B -m 9i -w 80 -E 1.0E-7 -d 20 C:\Windows\Temp\allE8BF.tmp fasta/version15.fasta
FASTA searches a protein or DNA sequence data bank
 version 35.04 Jan. 15, 2009
Please cite:
 W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: C:\Windows\Temp\allE8BF.tmp
  1>>>tr|B6V6A2|B6V6A2_APIME Silk fibroin 3 OS=Apis mellifera GN=Fibroin3 - 335 aa
Library: fasta/version15.fasta  443551 residues in  1897 sequences

 443551 residues in  1897 sequences
Statistics:  Expectation_n fit: rho(ln(x))= 5.2822+/-0.00476; mu= 7.6377+/- 0.243
 mean_var=103.5980+/-26.344, 0's: 1 Z-trim: 3  B-trim: 251 in 2/42
 Lambda= 0.126008
Algorithm: FASTA (3.5 Sept 2006) [optimized]
Parameters: BL50 matrix (15:-5) ktup: 2
 join: 37, opt: 25, open/ext: -10/-2, width:  16
 Scan time:  0.000
!! No sequences with E() < 0.000000


335 residues in 1 query   sequences
443551 residues in 1897 library sequences
 Scomplib [35.04]
 start: Wed Sep 16 15:21:18 2015 done: Wed Sep 16 15:21:18 2015
 Total Scan time:  0.000 Total Display time:  0.000

Function used was FASTA [version 35.04 Jan. 15, 2009]

User Query #1
>MaSP1 15-mer
MHHHHHHHHK LAVGGAGQGG YGGLGSQGAG RGGLGGQGAG AAAAAAVGGA GQGGYGGLGS QGAGRGGLGG QGAGAAAAAA VGGAGQGGYG GLGSQGAGRG GLGGQGAGAA AAAAVGGAGQ GGYGGLGSQG AGRGGLGGQG AGAAAAAAVG GAGQGGYGGL GSQGAGRGGL GGQGAGAAAA AAVGGAGQGG YGGLGSQGAG RGGLGGQGAG AAAAAAVGGA GQGGYGGLGS QGAGRGGLGG QGAGAAAAAA VGGAGQGGYG GLGSQGAGRG GLGGQGAGAA AAAAVGGAGQ GGYGGLGSQG AGRGGLGGQG AGAAAAAAVG GAGQGGYGGL GSQGAGRGGL GGQGAGAAAA AAVGGAGQGG YGGLGSQGAG RGGLGGQGAG AAAAAAVGGA GQGGYGGLGS QGAGRGGLGG QGAGAAAAAA VGGAGQGGYG GLGSQGAGRG GLGGQGAGAA AAAAVGGAGQ GGYGGLGSQG AGRGGLGGQG AGAAAAAAVG GAGQGGYGGL GSQGAGRGGL GGQGAGAAAA AAVGS

# fasta35.exe -q -H -B -m 9i -w 80 -E 1.0E-7 -d 20 C:\Windows\Temp\allA533.tmp fasta/version15.fasta
FASTA searches a protein or DNA sequence data bank
 version 35.04 Jan. 15, 2009
Please cite:
 W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: C:\Windows\Temp\allA533.tmp
  1>>>MaSP1 15-mer - 525 aa
Library: fasta/version15.fasta  443551 residues in  1897 sequences

 443551 residues in  1897 sequences
Statistics:  Expectation_n fit: rho(ln(x))= 5.1846+/-0.00535; mu= 0.6218+/- 0.274
 mean_var=153.7491+/-41.502, 0's: 1 Z-trim: 27  B-trim: 74 in 1/42
 Lambda= 0.103435
Algorithm: FASTA (3.5 Sept 2006) [optimized]
Parameters: BL50 matrix (15:-5) ktup: 2
 join: 37, opt: 25, open/ext: -10/-2, width:  16
 Scan time:  0.000

The best scores are:                                           opt z-sc E(1897) %_id  %_sim  alen
gi|27806257|gid|1565|collagen alpha-2(I) chain precurso (1364)  524 441.9 1.6e-019 0.330 0.554  558
gi|288860106|gid|160|high molecular weight glutenin sub ( 848)  520 440.7 1.8e-019 0.305 0.434  518
gi|736319|gid|160|glutenin [Triticum aestivum]          ( 838)  496 421.4 2.2e-018 0.307 0.431  534
gi|71084277|gid|160|HMW glutenin x-type subunit Bx7 pre ( 795)  448 382.9 3.1e-016 0.284 0.455  567
gi|21751|gid|160|high molecular weight glutenin subunit ( 648)  405 349.1 2.3e-014 0.293 0.424  512
gi|21779|gid|160|HMW glutenin-like protein product [Tri ( 660)  400 344.9  4e-014 0.289 0.420  505
gi|22090|gid|160|HMW glutenin subunit 1By9 [Triticum ae ( 705)  398 343.1 5.1e-014 0.302 0.438  527
gi|21743|gid|160|high molecular weight glutenin subunit ( 830)  392 337.5  1e-013 0.283 0.432  523
gi|508732625|gid|160|high molecular weight glutenin sub ( 794)  379 327.2 3.8e-013 0.275 0.432  516
gi|170743|gid|160|HMW glutenin subunit Ax2* [Triticum a ( 815)  379 327.1 3.9e-013 0.275 0.432  516

>>>MaSP1, 525 aa vs fasta/version15.fasta library

>>gi|27806257|gid|1565|collagen alpha-2(I) chain precursor [Bos taurus]       (1364 aa)
 initn: 119 init1: 119 opt: 524  Z-score: 441.9  bits: 92.8 E(): 1.6e-019
Smith-Waterman score: 578; 33.0% identity (55.4% similar) in 558 aa overlap (12-525:528-1080)

                                            10         20         30        40
MaSP1                               MHHHHHHHHKLAVGGAG-QGGYGGLGSQG-AGRGGLGGQGAGAAAAAAVGG
                                               : :  : ::  :: : :: ::  :. :  . :..:. .:
gi|278 GNIGFPGPKGPSGDPGKAGEKGHAGLAGARGAPGPDGNNGAQGPPGLQGVQGGKGEQGPAGPPGFQGLPGPAGTAGEAGK
       490       500       510       520       530       540       550       560

      50            60         70        80        90        100        110        120
MaSP1  AGQGG----YGGLGSQGA-GRGGLGGQGAGAAAAAAVGGAGQGGYGGL-GSQGAGRGGLGGQG-AGAAAAAAVGGA-GQG
        :. :    .:  :  :: :. :  :....:. .. .:. : .:  :  :..:   : .:. : :: .. ... :  : .
gi|278 PGERGIPGEFGLPGPAGARGERGPPGESGAAGPTGPIGSRGPSGPPGPDGNKGEP-GVVGAPGTAGPSGPSGLPGERGAA
       570       580       590       600       610       620        630       640

              130         140         150        160        170       180       190
MaSP1  GY-GGLGSQG-AG-RGGLGGQGAGAA--AAAAVGGAGQGGYGG-LGSQG-AGRGGLGGQGAGAAAAAAVGGAGQGGYGG-
       :  :: : .: .: :: .:. :  .:  : .:.:. : .: .:  :  : :: .: .:  .. .  . :: :: .:..:
gi|278 GIPGGKGEKGETGLRGDIGSPGRDGARGAPGAIGAPGPAGANGDRGEAGPAGPAGPAGPRGSPGERGEVGPAGPNGFAGP
        650       660       670       680       690       700       710       720

                  200       210       220       230       240         250       260
MaSP1  ---LGSQGA----GRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGA--AAAAAVGGAGQGGYGGL-G
           :. ::    :  :  :... .. .. ::.:: .: .:  . ...::  :  :: .  .::. .:  : .: .:  :
gi|278 AGAAGQPGAKGERGTKGPKGENGPVGPTGPVGAAGPSGPNGPPGPAGSRGDGGPPGATGFPGAAGRTGPPGPSGISGPPG
        730       740       750       760       770       780       790       800

            270       280       290        300        310       320        330
MaSP1  SQG-AGRGGLGGQGAGAAAAAAVGGAGQGGYGGL-GSQG-AGRGGLGGQGAGAAAAAAVGGAGQGGY-GGLGSQG----A
         : ::. :: :  .  . ..  : .: .:  :. : .: .:. : .:  .  .  . .:. :  :  :. : .:    :
gi|278 PPGPAGKEGLRGPRGDQGPVGRSGETGASGPPGFVGEKGPSGEPGTAGPPGTPGPQGLLGAPGFLGLPGSRGERGLPGVA
        810       820       830       840       850       860       870       880

         340       350       360         370       380       390        400       410
MaSP1  GRGGLGGQGAGAAAAAAVGGAGQGGYGGL-GSQG-AGRGGLGGQGAGAAAAAAVGGAGQGGY-GGLGSQGAGRGGLGGQG
       :  :  :  . :.  .: :  :. :  :. :. : ::: :  :. .  .  .  :  :. :: :. :  ::. :. : ::
gi|278 GSVGEPGPLGIAGPPGARGPPGNVGNPGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERGYPGNAGPVGAA-GAPGPQG
        890       900       910       920       930       940       950        960

            420        430         440       450       460        470        480
MaSP1  AGAAAAAAVGGAGQGG-YGGLGSQGA-G-RGGLGGQGAGAAAAAAVGGAGQGGYGGL-GSQG-AGRGGLGGQGAGAAAAA
         .. ..  :. :. :  :..:  :: : ::  : ::  .  .   :  :  :  :: : .:  :  ::.:. .  .: .
gi|278 P-VGPVGKHGNRGEPGPAGAVGPAGAVGPRGPSGPQGIRGDKGEP-GDKGPRGLPGLKGHNGLQGLPGLAGHHGDQGAPG
          970       980       990      1000       1010      1020      1030      1040

       490       500       510       520
MaSP1  AVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGS
       ::: ::  : .:  :  ::. :  :: .... :.  ::
gi|278 AVGPAGPRGPAG-PSGPAGKDGRIGQPGAVGPAGIRGSQGSQGPAGPPGPPGPPGPPGPSGGGYEFGFDGDFYRADQPRS
          1050       1060      1070      1080      1090      1100      1110      1120

>>gi|288860106|gid|160|high molecular weight glutenin subunit 5 [Triticum ae  (848 aa)
 initn: 195 init1: 195 opt: 520  Z-score: 440.7  bits: 91.9 E(): 1.8e-019
Smith-Waterman score: 524; 30.5% identity (43.4% similar) in 518 aa overlap (15-509:198-687)

                                         10        20         30           40        50
MaSP1                            MHHHHHHHHKLAVGGAGQGGYGGLGSQ-GAGR---GGLGGQGAGAAAAAAVGGA
                                               : :: ::   .::   :.    . : ::   . :
gi|288 WQQPEQGQPRYYPTSPQQSGQLQQPAQGQQPGQGQQGQQPGQGQPGYYPTSSQLQPGQLQQPAQGQQGQQPGQAQQGQQP
       160       170       180       190       200       210       220       230

               60        70        80        90       100       110       120       130
MaSP1  GQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQG
       :::   : :.::   :   ::  : .  .   : :: ::   . : .:.:    : .   ..    : ::.::   . :
gi|288 GQGQQPGQGQQGQQPGQ--GQQPGQGQQGQQLGQGQQGYYPTSLQQSGQG----QPGYYPTSLQQLGQGQSGYYPTSPQQ
       240       250         260       270       280           290       300       310

              140       150       160         170         180       190       200
MaSP1  AGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGA--GRGGLGGQGAGA--AAAAAVGGAGQGGYGGLGSQGAGRGGLGG
        :.:   ::    : .   : . ::   : :.::   :.:   :::  .   ..   .: :: ::   .::   ..   :
gi|288 PGQGQQPGQLQQPAQGQQPGQGQQGQQPGQGQQGQQPGQGQQPGQGQPGYYPTSPQQSGQGQPGYYPTSSQQPTQSQQPG
             320       330       340       350       360       370       380       390

        210       220       230       240         250       260       270       280
MaSP1  QGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGA--AAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAA
       ::            ::    :  .:  :.:   :::  .   ..   .: :: ::   . : .:.:   ::   .: .
gi|288 QGQ----------QGQQVGQGQQAQQPGQGQQPGQGQPGYYPTSPQQSGQGQPGYYLTSPQQSGQGQQPGQLQQSAQGQK
                       400       410       420       430       440       450       460

          290       300       310       320       330       340           350
MaSP1  VGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQ----GAGAAAAAAVG----GA
           :::   : :.::  . : : ::          : :: ::   . : .:.:   ::    : :  .   ..    :
gi|288 GQQPGQGQQPGQGQQGQ-QPGQGQQGQQP-------GQGQPGYYPTSPQQSGQGQQPGQWQQPGQGQPGYYPTSPLQPGQ
             470        480              490       500       510       520       530

        360       370       380       390       400       410       420       430
MaSP1  GQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQG
       :: ::   . :  :.:    :  :     : :  ::    :  .:  ..   : . : .  .   : . ::   : :.::
gi|288 GQPGYDPTSPQQPGQG----QQPGQLQQPAQGQQGQQLAQGQQGQQPAQVQQGQRPAQGQQGQQPGQGQQGQQLGQGQQG
           540           550       560       570       580       590       600

          440       450       460       470         480       490       500        510
MaSP1  A--GRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGA--GRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQ-GAGRGGLG
          :.:  : : : .  .   : . ::   : :.::   :.:   :::      ..   .:::   :  .: : :. :
gi|288 QQPGQGQQGQQPAQGQQGQQPGQGQQGQQPGQGQQGQQPGQGQQPGQGQPWYYPTSPQESGQGQQPGQWQQPGQGQPGYY
     610       620       630       640       650       660       670       680

             520
MaSP1  GQGAGAAAAAAVGS

gi|288 LTSPLQLGQGQQGYYPTSLQQPGQGQQPGQWQQSGQGQHWYYPTSPQLSGQGQRPGQWLQPGQGQQGYYPTSPQQPGQGQ
     690       700       710       720       730       740       750       760

>>gi|736319|gid|160|glutenin [Triticum aestivum]                              (838 aa)
 initn: 184 init1: 184 opt: 496  Z-score: 421.4  bits: 88.3 E(): 2.2e-018
Smith-Waterman score: 512; 30.7% identity (43.1% similar) in 534 aa overlap (18-513:229-736)

                                      10        20          30        40        50
MaSP1                         MHHHHHHHHKLAVGGAGQGGYGGLGSQGA--GRGGLGGQGAGAAAAAAVGGAGQGGY
                                               ::   : :.::   :.:   :::  .   .     :::
gi|736 QGQQPGQGQQGRQPGQGQPGYYPTSSQLQPGQLQQPAQGQQGQQPGQGQQGQQPGQGQQPGQGQQGQQPGQGQQPGQGQQ
      190       200       210       220       230       240       250       260

             60          70        80          90       100       110       120
MaSP1  G---GLGSQGAGRGGL--GGQGAGAAAAAAVG--GAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGS
       :   : :.::    .:  .:::  .   ...   : ::.::   . :  :.:   ::    : .     . ::   : :.
gi|736 GQQLGQGQQGYYPTSLQQSGQGQPGYYPTSLQQLGQGQSGYYPTSPQQPGQGQQPGQLQQPAQGQQPEQGQQGQQPGQGQ
      270       280       290       300       310       320       330       340

      130         140         150       160       170       180       190       200
MaSP1  QGA--GRGGLGGQGAGA--AAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGL
       ::   :.:   :::  .   ..   .: :: ::   .::   ..   :::            ::    :  .:  :.:
gi|736 QGQQPGQGQQPGQGQPGYYPTSPQQSGQGQPGYYPTSSQQPTQSQQPGQGQ----------QGQQVGQGQQAQQPGQGQQ
      350       360       370       380       390                 400       410

           210        220       230       240       250       260       270       280
MaSP1  GGQGA-GAAAAAAV-GGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAA
        :::  :   .. . .: :: ::   . : .:.:   ::   .: .      :::   : :.::  . : : ::
gi|736 PGQGQPGYYPTSPLQSGQGQPGYYLTSPQQSGQGQQPGQLQQSAQGQKGQQPGQGQQPGQGQQGQ-QPGQGQQGQQP---
      420       430       440       450       460       470       480        490

            290       300           310       320           330       340       350
MaSP1  AAVGGAGQGGYGGLGSQGAGRGGLGGQ----GAGAAAAAAVG----GAGQGGYGGLGSQGAGRGGLGGQGAGAAAAA---
           : :: ::   . : .:.:   ::    : :  .   ..    : :: ::   . :  :.:   ::    : .
gi|736 ----GQGQPGYYPTSPQQSGQGQQPGQWQQPGQGQPGYYPTSPLQPGQGQPGYDPTSPQQPGQGQQPGQLQQPAQGQQGQ
              500       510       520       530       540       550       560       570

                      360       370        380       390       400       410       420
MaSP1  --AVGGAGQ-------GGYGGLGSQGAGRG-GLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAV
         : :  ::       :   . :.::   : :  ::  : .   : :  ::    :  .:  :.:   :::      ..
gi|736 QLAQGQQGQQPAQVQQGQQPAQGQQGQQLGQGQQGQQPGQGQQPAQGQQGQQPGQGQQGQQPGQGQQPGQGQPWYYPTSP
              580       590       600       610       620       630       640       650

             430       440        450        460       470       480       490
MaSP1  GGAGQGGYGGLGSQGAGRGGLGGQGA-GAAAAAAVG-GAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGG
         .:::   :  .:  :.    :::  :   .. .  : :: ::   . :  :.:   ::   .       : :: ::
gi|736 QESGQGQQPGQWQQ-PGQWQQPGQGQPGYYLTSPLQLGQGQQGYYPTSLQQPGQGQQPGQWQQS-------GQGQHGYYP
              660        670       680       690       700       710              720

     500       510       520
MaSP1  LGSQGAGRGGLGGQGAGAAAAAAVGS
        . : .:.:   ::
gi|736 TSPQLSGQGQRPGQWLQPGQGQQGYYPTSPQQSGQGQQLGQWLQPGQGQQGYYPTSLQQTGQGQQSGQGQQGYYSSYHVS
            730       740       750       760       770       780       790       800

>>gi|71084277|gid|160|HMW glutenin x-type subunit Bx7 precursor [Triticum ae  (795 aa)
 initn: 161 init1: 161 opt: 448  Z-score: 382.9  bits: 81.1 E(): 3.1e-016
Smith-Waterman score: 516; 28.4% identity (45.5% similar) in 567 aa overlap (15-520:195-754)

                                         10        20        30        40        50
MaSP1                            MHHHHHHHHKLAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGG
                                               : :: ::   . : .:.:   :::  .   ..   .::
gi|710 YPTSPQQPGQGQQLGQGQPGYYPTSQQPGQKQQAGQGQQSGQGQQGYYPTSPQQSGQGQQPGQGQPGYYPTSPQQSGQWQ
          160       170       180       190       200       210       220       230

           60        70          80        90       100       110             120
MaSP1  YGGLGSQGAGRGGLGGQGAGAA--AAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAV--GGA----GQGGYGGL
         : :.:  :.:  .:::  .   . .   : :: ::  .. :  :.:  .:::  .   ...   :     :::   :
gi|710 QPGQGQQ-PGQGQQSGQGQQGQQPGQGQRPGQGQQGYYPISPQQPGQGQQSGQGQPGYYPTSLRQPGQWQQPGQGQQPGQ
          240        250       260       270       280       290       300       310

        130         140       150       160          170       180       190       200
MaSP1  GSQGA--GRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQG---AGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGR
       :.::   :.:  .:::  .   ...   :::   : :. :   ... .  ::  : .      : :: ::   . : .:.
gi|710 GQQGQQPGQGQQSGQGQQGYYPTSLQQPGQGQQLGQGQPGYYPTSQQSEQGQQPGQGKQP---GQGQQGYYPTSPQQSGQ
           320       330       340       350       360       370          380       390

             210       220       230               240       250       260       270
MaSP1  GGLGGQGAGAAAAAAVGGAGQGGYGGLGSQG--------AGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLG
       :   :::  .   ..   .:::  .: :.::        .:.:   ::: ..   ..   .:::   : :.: .:.:  :
gi|710 GQQLGQGQPGYYPTSPQQSGQGQQSGQGQQGYYPTSPQQSGQGQQPGQGQSGYFPTSRQQSGQGQQPGQGQQ-SGQGQQG
              400       410       420       430       440       450       460

                            280       290       300       310       320       330
MaSP1  ---GQGAGA--------------AAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAG
          :::  :              :.     : :: ::   . :  :.   .::.  ..    :    .    :  .: :
gi|710 QQPGQGQQAYYPTSSQQSRQRQQAGQWQRPGQGQPGYYPTSPQQPGQEQQSGQAQQSGQWQLVYYPTSPQQPGQLQQPA-
     470       480       490       500       510       520       530       540

        340       350       360            370       380         390       400
MaSP1  RGGLGGQGAGAAAAAAVGGAGQGG-----YGGLGSQGAGRGGLGGQGAGA--AAAAAVGGAGQGGYGGLGSQGAGRGGLG
       .:   .::  .:     : : :.:     :   . :  :.    .::  .   ..   .: :: ::   . : .:.:  :
gi|710 QGQQPAQGQQSAQEQQPGQAQQSGQWQLVYYPTSPQQPGQLQQPAQGQQGYYPTSPQQSGQGQQGYYPTSPQQSGQGQQG
      550       560       570       580       590       600       610       620

          410       420          430       440       450       460       470       480
MaSP1  -----GQGAGAAAAAAVGGA---GQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGA
             : .: .   . :     :: ::  .. : .:.:   :::  .   ..   .:::   :   :  :.    :::
gi|710 YYPTSPQQSGQGQQPGQGQQPRQGQQGYYPISPQQSGQGQQPGQGQQGYYPTSPQQSGQGQQPG-HEQQPGQWLQPGQGQ
      630       640       650       660       670       680       690        700

             490       500               510       520
MaSP1  GAAAAAAVGGAGQGGYGGLGSQG--------AGRGGLGGQGAGAAAAAAVGS
        .   ..   .:::  .: :.::         :.:   :::  . :.
gi|710 QGYYPTSSQQSGQGHQSGQGQQGYYPTSLWQPGQGQQPGQGQQGYASPYHVSAEYQAARLKVAKAQQLAAQLPAMCRLEG
       710       720       730       740       750       760       770       780

gi|710 SDALSTRQ
       790

>>gi|21751|gid|160|high molecular weight glutenin subunit 10 [Triticum aesti  (648 aa)
 initn:  97 init1:  97 opt: 405  Z-score: 349.1  bits: 74.5 E(): 2.3e-014
Smith-Waterman score: 426; 29.3% identity (42.4% similar) in 512 aa overlap (18-514:125-595)

                                      10        20          30             40        50
MaSP1                         MHHHHHHHHKLAVGGAGQGGYGGLGS--QGAGRGGLG-----GQGAGAAAAAAVGGA
                                               :: : :. :  ::.   : .     :::   .      :
gi|217 ARQYEQTVVPPKGGSFYPGETTPLQQLQQGIFWGTSSQTVQGYYPGVTSPRQGSYYPGQASPQQPGQGQQPGKWQE-PGQ
           90       100       110       120       130       140       150       160

               60        70        80          90       100       110       120
MaSP1  GQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQG--GYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGS
       ::  :   . :  :.:   :.:  .   ...   :::  ::   . : .:.     ::     .    :  : ::   .
gi|217 GQQWYYPTSLQQPGQGQQIGKGQQGYYPTSLQQPGQGQQGYYPTSLQHTGQRQQPVQGQQPEQGQQ-PGQWQQGYYPTSP
           170       180       190       200       210       220        230       240

      130       140       150       160       170       180       190       200
MaSP1  QGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQG
       :  :.:    :   .       : :: :.   . :  :.:    ::   :.    : . :: : .  .:     : : ::
gi|217 QQLGQGQQPRQWQQS-------GQGQQGHYPTSLQQPGQGQ---QGHYLASQQQPGQGQQGHYPASQQQP----GQGQQG
            250              260       270          280       290       300

      210       220       230       240       250       260       270       280
MaSP1  AGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGA
          :.    : . :: :   .::   . : : ::   :.    : . :: : .  .:     : : ::   ..   .: .
gi|217 HYPASQQQPGQGQQGHYP--ASQQ--EPGQGQQGQIPASQQQPGQGQQGHYPASLQQP----GQGQQGHYPTSLQQLGQG
      310       320         330         340       350       360           370       380

      290       300       310       320       330       340       350       360
MaSP1  GQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYG----GL
        : :  :  .:  :.:   :::          : :: ::   . :  :.:   :::  .   ...   :::  :    .:
gi|217 QQTGQPGQKQQ-PGQGQQTGQGQQPEQEQQ-PGQGQQGYYPTSLQQPGQGQQQGQGQQGYYPTSLQQPGQGQQGHYPASL
              390        400        410       420       430       440       450

          370        380        390       400       410       420       430       440
MaSP1  GSQGAGRGGLGGQ-GAGA-AAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGL
        . : :. :   : : :     .   : :: ::   . :  :.:   :::             : ::   . :  :.:
gi|217 QQPGQGQPGQRQQPGQGQHPEQGKQPGQGQQGYYPTSPQQPGQGQQLGQG-------------QQGYYPTSPQQPGQGQQ
      460       470       480       490       500                    510       520

            450       460       470       480       490       500       510       520
MaSP1  GGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAA
        :::  .   ..   .::.   : :.:  :.    :::  .   ..:   :::  .: :.: .:.:   :::
gi|217 PGQGQQGHCPTSPQQSGQAQQPGQGQQ-IGQVQQPGQGQQGYYPTSVQQPGQGQQSGQGQQ-SGQGHQPGQGQQSGQEQQ
         530       540       550        560       570       580        590       600


MaSP1  VGS

gi|217 GYDSPYHVSAEQQAASPMVAKAQQPATQLPTVCRMEGGDALSASQ
           610       620       630       640

>>gi|21779|gid|160|HMW glutenin-like protein product [Triticum aestivum]      (660 aa)
 initn: 100 init1: 100 opt: 400  Z-score: 344.9  bits: 73.8 E(): 4e-014
Smith-Waterman score: 433; 28.9% identity (42.0% similar) in 505 aa overlap (23-514:150-607)

                                 10        20        30        40        50        60
MaSP1                    MHHHHHHHHKLAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQG
                                               : :.:  :.    :::      ...   :::   : :.::
gi|217 QLQQGIFWGTSSQTVQGYYPSVTSPRQGSYYPGQASPQQPGQGQQ-PGKWQEPGQGQQWYYPTSLQQPGQGQQIGKGKQG
     110       120       130       140       150        160       170       180

             70        80        90         100       110          120       130
MaSP1  AGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGR--GGLGGQGAGAAAAAAVG---GAGQGGYGGLGSQGAGRGGLG
           .:   : :        : :: ::   . : .:.    . ::  : .     :   :  : ::   . :  :.:
gi|217 YYPTSLQQPGQGQQI-----GQGQQGYYPTSPQHTGQRQQPVQGQQIGQGQQPEQGQQPGQWQQGYYPTSPQQLGQGQQP
      190       200            210       220       230       240       250       260

       140       150       160       170       180       190       200       210
MaSP1  GQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAV
       ::   .       : :: :.   . :  :.:  :   :.    :     :: :.   ..:  :.:  :   :.
gi|217 GQWQQS-------GQGQQGHYPTSLQQPGQGQQGHYLASQQQPAQ----GQQGHYPASQQQPGQGQQGHYPASQQQP---
                  270       280       290       300           310       320

       220       230       240       250       260       270       280       290
MaSP1  GGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLG
        : :: :.   ..:  :.:    ::   :.    : . :: : .  .:       : ::   ..   .: . : :  :
gi|217 -GQGQQGHYPASQQEPGQG---QQGQIPASQQQPGQGQQGHYPASLQQP------GQQGHYPTSLQQLGQGQQIGQPGQK
      330       340          350       360       370             380       390

       300       310       320       330       340       350       360           370
MaSP1  SQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYG----GLGSQGAGRGG
       .:  :.:   :::          : :: ::   . :  :.:   :::  .   ...   :::  :    .: . : :.:
gi|217 QQ-PGQGQQTGQGQQPEQEQQ-PGQGQQGYYPTSLQQPGQGQQQGQGQQGYYPTSLQQPGQGQQGHYPASLQQPGQGQGQ
     400        410        420       430       440       450       460       470

            380          390       400       410       420       430       440
MaSP1  LGG-QGAGAAAAAAVG---GAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGA
        :  :  : .     :   : :: ::   . :  :.:   :::             : ::   . :  :.:   :::  .
gi|217 PGQRQQPGQGQHPEQGQQPGQGQQGYYPTSPQQPGQGQQLGQG-------------QQGYYPTSPQQPGQGQQPGQGQQG
       480       490       500       510       520                    530       540

     450       460       470       480       490       500       510       520
MaSP1  AAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGS
           .   .::.   : :.:  :.    :::  .   ...   :::  .: :.: .:.:   :::
gi|217 HCPMSPQQTGQAQQLGQGQQ-IGQVQQPGQGQQGYYPTSLQQPGQGQQSGQGQQ-SGQGHQPGQGQQSGQEKQGYDSPYH
          550       560        570       580       590        600       610       620

gi|217 VSAEQQAASPMVAKAQQPATQLPTVCRMEGGDALSASQ
            630       640       650       660

>>gi|22090|gid|160|HMW glutenin subunit 1By9 [Triticum aestivum]              (705 aa)
 initn: 110 init1: 110 opt: 398  Z-score: 343.1  bits: 73.6 E(): 5.1e-014
Smith-Waterman score: 483; 30.2% identity (43.8% similar) in 527 aa overlap (15-514:162-652)

                                         10        20               30        40
MaSP1                            MHHHHHHHHKLAVGGAGQGGY-------GGLGSQGAGRGGLGGQGAGAAAAAAV
                                               : :: ::       .: :.::   ..:   : :
gi|220 QTVQGYYPSVSSPQQGPYYPGQASPQQPGQGQQPGKWQELGQGQQGYYPTSLHQSGQGQQGYYPSSLQQPGQGQQI----
             130       140       150       160       170       180       190

        50        60        70        80        90       100       110       120
MaSP1  GGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLG
        : :: ::   . :  :.:   :::  .   ..    ::    : :.:  :.:   :::   . .   .: :: ::   .
gi|220 -GQGQQGYYPTSLQQPGQGQQIGQGQQGYYPTSPQHPGQRQQPGQGQQ-IGQGQQLGQGRQIGQGQQ-SGQGQQGYYPTS
        200       210       220       230       240        250       260        270

       130       140       150       160       170       180       190       200
MaSP1  SQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQ
        :  :.:   ::   .       : :: ::   ..:  :.   : ::   :.    : . :: : .  .:     : : :
gi|220 PQQLGQGQQPGQWQQS-------GQGQQGYYPTSQQQPGQ---GQQGQYPASQQQPGQGQQGQYPASQQQ----PGQGQQ
          280       290              300          310       320       330           340

       210       220       230       240       250       260             270       280
MaSP1  GAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYG------GLGSQGAGRGGLGGQGAGAAA
       :   :.    : . :: :  :.::   . : : :    :.    : . :: :       : :.::   ..:   : :
gi|220 GQYPASQQQPGQGQQGHY--LASQQ--QPGQGQQRHYPASLQQPGQGQQGHYTASLQQPGQGQQGHYPASLQQVGQGQQI
              350         360         370       380       390       400       410

             290       300         310         320          330       340       350
MaSP1  AAAVGGAGQGGYGGLGSQGAG--RGGLGGQGAGAAAAAAV--G---GAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVG
       .  .:   : : :    ::    .:   :::  .  .  .  :   : :: ::   . : .:.:   ::.
gi|220 GQ-LGQRQQPGQGQQTRQGQQLEQGQQPGQGQQTRQGQQLEQGQQPGQGQQGYYPTSPQQSGQGQQPGQSQQP-------
         420       430       440       450       460       470       480

          360       370       380       390       400         410       420
MaSP1  GAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQG--AGRGGLGGQGAGAAAAAAVG-----GAGQG
       : :: :: . . :  :.: : :.  ..    . :  ::    : :.:   . . : : ::   ..    :     : ::
gi|220 GQGQQGYYSSSLQQPGQG-LQGHYPASLQQPGQGHPGQRQQPGQGQQPEQGQQPGQGQQGYYPTSPQQPGQGKQLGQGQQ
      490       500        510       520       530       540       550       560

       430       440       450       460       470       480       490       500
MaSP1  GYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGR
       ::   . :  :.:   :::  .   ..   .::.   : :.:  :.    :::  .    ..  .:::  .: :.: .:.
gi|220 GYYPTSPQQPGQGQQPGQGQQGHCPTSPQQTGQAQQPGQGQQ-IGQVQQPGQGQQGYYPISLQQSGQGQQSGQGQQ-SGQ
       570       580       590       600        610       620       630       640

       510       520
MaSP1  GGLGGQGAGAAAAAAVGS
       :   :::
gi|220 GHQLGQGQQSGQEQQGYDNPYHVNTEQQTASPKVAKVQQPATQLPIMCRMEGGDALSASQ
         650       660       670       680       690       700

>>gi|21743|gid|160|high molecular weight glutenin subunit 1Ax1 [Triticum aes  (830 aa)
 initn:  81 init1:  81 opt: 392  Z-score: 337.5  bits: 72.8 E(): 1e-013
Smith-Waterman score: 426; 28.3% identity (43.2% similar) in 523 aa overlap (18-513:211-712)

                                      10        20        30         40        50
MaSP1                         MHHHHHHHHKLAVGGAGQGGYGGLGSQGAGRG-GLGGQGAGAAAAAAVGGAGQGGYG
                                               ::  :  ..::  :    ..:  :     : :  ::
gi|217 QQSGQEQPGYYPTSPWQPEQLQQPTQGQQRQQPGQGQQLRQGQQGQQSGQGQPRYYPTSSQQPGQLQQLAQGQQGQQPER
              180       190       200       210       220       230       240       250

         60        70        80          90       100         110       120       130
MaSP1  GLGSQGAGRGGLGGQGAGAAAAAAVGGAGQG--GYGGLGSQGAGRGGLGGQGAGA--AAAAAVGGAGQGGYGGLGSQGAG
       :  .: .:.:   :::  .   .    .:::  ::  .. :  :.:  .:::  .   ..   .: ::.::   ..:  :
gi|217 GQQGQQSGQGQQLGQGQQGQQPGQKQQSGQGQQGYYPISPQQLGQGQQSGQGQLGYYPTSPQQSGQGQSGYYPTSAQQPG
              260       270       280       290       300       310       320       330

              140       150        160         170          180       190       200
MaSP1  R--GGLGGQGAGAAAAAAVGGAG-QGGYGGLGSQG--AGRGGLGGQ---GAGAAAAAAVGGAGQGGYGGLGSQGAGRGGL
       .   .   :  :       .: : ::  .:  .:   .:.:   ::   :  ...   .:  ::  :   . :  :.
gi|217 QLQQSTQEQQLGQEQQDQQSGQGRQGQQSGQRQQDQQSGQGQQPGQRQPGYYSTSPQQLG-QGQPRYYPTSPQQPGQE--
              340       350       360       370       380       390        400

          210       220       230       240       250       260       270       280
MaSP1  GGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAA
         :          :  ::    :  .:  :.:  : :  : .  .   : :: ::   . : .:.:  :   ..   ..
gi|217 --QQPRQLQQPEQGQQGQQPEQGQQGQQPGQGEQGQQ-PGQGQQGQQPGQGQPGYYPTSPQQSGQGQPGYYPTSPQQSGQ
         410       420       430       440        450       460       470       480

          290       300        310       320       330       340              350
MaSP1  VGGAGQGGYGGLGSQGAGRG-GLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGL------GGQGA-GAAAAAAVG-G
       .   .::   :  .::   : :  ::  : .      : :: ::   . : .:.         .:::  :   .. .  :
gi|217 LQQPAQGQQPGQEQQGQQPGQGQQGQQPGQGQQP---GQGQPGYYPTSPQQSGQEQQLEQWQQSGQGQPGHYPTSPLQPG
          490       500       510          520       530       540       550       560

         360       370       380       390          400         410       420       430
MaSP1  AGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYG---GLGSQGA--GRGGLGGQGAGAAAAAAVGGAGQGGYG
        :: ::   . :  :.:   ::    . .      :::  :   : :.::   :.:   :::  .   ...  .:::
gi|217 QGQPGYYPTSPQQIGQGQQPGQLQQPTQGQQGQQPGQGQQGQQPGQGQQGQQPGQGQQPGQGQPGYYPTSLQQSGQGQQP
             570       580       590       600       610       620       630       640

              440       450       460       470       480       490       500       510
MaSP1  GLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGL
       :  .:  :.: : :    ..        :: ::   ..:  :.:   ::   .       : :: ::   . : .:.:
gi|217 GQWQQ-PGQG-LPGYYPTSSLQPE---QGQQGYYPTSQQQPGQGPQPGQWQQS-------GQGQQGYYPTSPQQSGQGQQ
              650        660          670       680              690       700

              520
MaSP1  GGQGAGAAAAAAVGS
        ::
gi|217 PGQWLQPGQWLQSGYYLTSPQQLGQGQQPRQWLQPRQGQQGYYPTSPQQSGQGQQLGQGQQGYYPTSPQQSGQGQQGYDS
     710       720       730       740       750       760       770       780

>>gi|508732625|gid|160|high molecular weight glutenin subunit, partial [Trit  (794 aa)
 initn:  85 init1:  85 opt: 379  Z-score: 327.2  bits: 70.8 E(): 3.8e-013
Smith-Waterman score: 404; 27.5% identity (43.2% similar) in 516 aa overlap (18-513:184-676)

                                      10        20        30         40        50
MaSP1                         MHHHHHHHHKLAVGGAGQGGYGGLGSQGAGRG-GLGGQGAGAAAAAAVGGAGQGGYG
                                               ::  :  ..::  :    ..:  :     : :  ::
gi|508 QQSGQKQPGYYPTSPWQPEQLQQPTQGQQRQQPGQGQQLRQGQQGQQSGQGQPRYYPTSSQQPGQLQQLAQGQQGQQPER
           150       160       170       180       190       200       210       220

         60        70        80          90       100         110       120       130
MaSP1  GLGSQGAGRGGLGGQGAGAAAAAAVGGAGQG--GYGGLGSQGAGRGGLGGQGAGA--AAAAAVGGAGQGGYGGLGSQGAG
       :  .: .:.:   :::  .   .    .:::  ::  .. :  :.:  .:::  .   ..   .: ::.::   ..:  :
gi|508 GQQGQQSGQGQQLGQGQQGQQPGQKQQSGQGQQGYYPISPQQLGQGQQSGQGQLGYYPTSPQQSGQGQSGYYPTSAQQPG
           230       240       250       260       270       280       290       300

              140       150        160         170          180       190       200
MaSP1  R--GGLGGQGAGAAAAAAVGGAG-QGGYGGLGSQG--AGRGGLGGQ---GAGAAAAAAVGGAGQGGYGGLGSQGAGRGGL
       .   .   :  :       .: : ::  .:  .:   .:.:   ::   :  ...   .:  ::  :   . :  :.
gi|508 QLQQSTQEQQLGQEQQDQQSGQGRQGQQSGQRQQDQQSGQGQQPGQRQPGYYSTSPQQLG-QGQPRYYPTSPQQPGQE--
           310       320       330       340       350       360        370       380

          210       220       230       240       250       260       270       280
MaSP1  GGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAA
         :          :  ::    :  .:   : :  ::  : .  .   : :: ::   . : .:.:  :   ..   ..
gi|508 --QQPRQLQQPEQGQQGQQPEQGQQGQ-QQRQGEQGQQPGQGQQGQQPGQGQPGYYPTSPQQSGQGQPGYYPTSPQQSGQ
                390       400        410       420       430       440       450

          290       300         310       320       330       340       350       360
MaSP1  VGGAGQGGYGGLGSQGA--GRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYG
       .   .::   :  .::   :.:   :::  .   ..   .::       .: .:.:    : .   ..    : :: ::
gi|508 LQQPAQGQQPGQEQQGQQPGQGQQPGQGQPGYYPTSPQQSGQEQQLEQWQQ-SGQG----QPGHYPTSPLQPGQGQPGYY
       460       470       480       490       500        510           520       530

            370       380       390          400         410       420       430
MaSP1  GLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYG---GLGSQGA--GRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGA
         . :  :.:   ::    . .      :::  :   : :.::   :.:   :::  .   ...  .:::   :  .:
gi|508 PTSPQQIGQGQQPGQLQQPTQGQQGQQPGQGQQGQQPGEGQQGQQPGQGQQPGQGQPGYYPTSLQQSGQGQQPGQWQQ-P
            540       550       560       570       580       590       600       610

       440       450       460       470       480       490       500       510
MaSP1  GRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGA
       :.:  :   ...         :: ::   ..:  :.:   ::   .       : :: ::   . : .:.:   ::
gi|508 GQGQPGYYPTSSLQPEQ----GQQGYYPTSQQQPGQGPQPGQWQQS-------GQGQQGYYPTSPQQSGQGQQPGQWLQP
             620           630       640       650              660       670       680

       520
MaSP1  AAAAAVGS

gi|508 GQWLQSGYYLTSPQQLGQGQQPRQWLQPRQGQQGYYPTSPQQSGQGQQLGQGQQGYYPTSPQQSGQGQQGYDSPYHVSAE
              690       700       710       720       730       740       750       760

>>gi|170743|gid|160|HMW glutenin subunit Ax2* [Triticum aestivum]             (815 aa)
 initn:  85 init1:  85 opt: 379  Z-score: 327.1  bits: 70.8 E(): 3.9e-013
Smith-Waterman score: 404; 27.5% identity (43.2% similar) in 516 aa overlap (18-513:205-697)

                                      10        20        30         40        50
MaSP1                         MHHHHHHHHKLAVGGAGQGGYGGLGSQGAGRG-GLGGQGAGAAAAAAVGGAGQGGYG
                                               ::  :  ..::  :    ..:  :     : :  ::
gi|170 QQSGQKQPGYYPTSPWQPEQLQQPTQGQQRQQPGQGQQLRQGQQGQQSGQGQPRYYPTSSQQPGQLQQLAQGQQGQQPER
          170       180       190       200       210       220       230       240

         60        70        80          90       100         110       120       130
MaSP1  GLGSQGAGRGGLGGQGAGAAAAAAVGGAGQG--GYGGLGSQGAGRGGLGGQGAGA--AAAAAVGGAGQGGYGGLGSQGAG
       :  .: .:.:   :::  .   .    .:::  ::  .. :  :.:  .:::  .   ..   .: ::.::   ..:  :
gi|170 GQQGQQSGQGQQLGQGQQGQQPGQKQQSGQGQQGYYPISPQQLGQGQQSGQGQLGYYPTSPQQSGQGQSGYYPTSAQQPG
          250       260       270       280       290       300       310       320

              140       150        160         170          180       190       200
MaSP1  R--GGLGGQGAGAAAAAAVGGAG-QGGYGGLGSQG--AGRGGLGGQ---GAGAAAAAAVGGAGQGGYGGLGSQGAGRGGL
       .   .   :  :       .: : ::  .:  .:   .:.:   ::   :  ...   .:  ::  :   . :  :.
gi|170 QLQQSTQEQQLGQEQQDQQSGQGRQGQQSGQRQQDQQSGQGQQPGQRQPGYYSTSPQQLG-QGQPRYYPTSPQQPGQE--
          330       340       350       360       370       380        390       400

          210       220       230       240       250       260       270       280
MaSP1  GGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAA
         :          :  ::    :  .:   : :  ::  : .  .   : :: ::   . : .:.:  :   ..   ..
gi|170 --QQPRQLQQPEQGQQGQQPEQGQQGQ-QQRQGEQGQQPGQGQQGQQPGQGQPGYYPTSPQQSGQGQPGYYPTSPQQSGQ
               410       420        430       440       450       460       470

          290       300         310       320       330       340       350       360
MaSP1  VGGAGQGGYGGLGSQGA--GRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYG
       .   .::   :  .::   :.:   :::  .   ..   .::       .: .:.:    : .   ..    : :: ::
gi|170 LQQPAQGQQPGQEQQGQQPGQGQQPGQGQPGYYPTSPQQSGQEQQLEQWQQ-SGQG----QPGHYPTSPLQPGQGQPGYY
      480       490       500       510       520        530           540       550

            370       380       390          400         410       420       430
MaSP1  GLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYG---GLGSQGA--GRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGA
         . :  :.:   ::    . .      :::  :   : :.::   :.:   :::  .   ...  .:::   :  .:
gi|170 PTSPQQIGQGQQPGQLQQPTQGQQGQQPGQGQQGQQPGEGQQGQQPGQGQQPGQGQPGYYPTSLQQSGQGQQPGQWQQ-P
           560       570       580       590       600       610       620       630

       440       450       460       470       480       490       500       510
MaSP1  GRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGAAAAAAVGGAGQGGYGGLGSQGAGRGGLGGQGAGA
       :.:  :   ...         :: ::   ..:  :.:   ::   .       : :: ::   . : .:.:   ::
gi|170 GQGQPGYYPTSSLQPEQ----GQQGYYPTSQQQPGQGPQPGQWQQS-------GQGQQGYYPTSPQQSGQGQQPGQWLQP
            640           650       660       670              680       690       700

       520
MaSP1  AAAAAVGS

gi|170 GQWLQSGYYLTSPQQLGQGQQPRQWLQPRQGQQGYYPTSPQQSGQGQQLGQGQQGYYPTSPQQSGQGQQGYDSPYHVSAE
             710       720       730       740       750       760       770       780



525 residues in 1 query   sequences
443551 residues in 1897 library sequences
 Scomplib [35.04]
 start: Wed Sep 16 15:04:38 2015 done: Wed Sep 16 15:04:38 2015
 Total Scan time:  0.000 Total Display time:  0.000

Function used was FASTA [version 35.04 Jan. 15, 2009]

User Query #1
>B. mori co-spinning module
MHHHHHHMRV KTFVILCCAL QYVAYTNANI NDFDEDYFGS DVTVQSSNTT DEIIRDASGA VIEEQITTKK MQRKNKNHGI LGKNEKMIKT FVITTDSDGN ESIVEEDVLM KTLSDGTVAQ SYVAADAGAY SQSGPYVSNC GYSTHQGYTS DFSTSAAVGA GSQLETMRKG EELFTGVVPI LVELDGDVNG HKFSVRGEGE GDATNGKLTL KFICTTGKLP VPWPTLVTTL TYGVQCFARY PDHMKQHDFF KSAMPEGYVQ ERTISFKDDG TYKTRAEVKF EGDTLVNRIE LKGIDFKEDG NILGHKLEYN FNSHNVYITA DKQKNGIKAN FKIRHNVEDG SVQLADHYQQ NTPIGDGPVL LPDNHYLSTQ SVLSKDPNEK RDHMVLLEFV TAAGITHGMD ELYKGLARSV SYGAGRGYGQ GAGSAASSVS SASSRSYDYS RRNVRKNCGI PRRQLVVKFR ALPCVNC

# fasta35.exe -q -H -B -m 9i -w 80 -E 1.0E-7 -d 20 C:\Windows\Temp\all3D87.tmp fasta/version15.fasta
FASTA searches a protein or DNA sequence data bank
 version 35.04 Jan. 15, 2009
Please cite:
 W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: C:\Windows\Temp\all3D87.tmp
  1>>>B. mori co-spinning module - 467 aa
Library: fasta/version15.fasta  443551 residues in  1897 sequences

 443551 residues in  1897 sequences
Statistics:  Expectation_n fit: rho(ln(x))= 3.2742+/-0.0036; mu= 22.2869+/- 0.191
 mean_var=65.2176+/-17.192, 0's: 1 Z-trim: 1  B-trim: 329 in 2/42
 Lambda= 0.158815
Algorithm: FASTA (3.5 Sept 2006) [optimized]
Parameters: BL50 matrix (15:-5) ktup: 2
 join: 37, opt: 25, open/ext: -10/-2, width:  16
 Scan time:  0.000
!! No sequences with E() < 0.000000


467 residues in 1 query   sequences
443551 residues in 1897 library sequences
 Scomplib [35.04]
 start: Wed Sep 16 15:06:22 2015 done: Wed Sep 16 15:06:22 2015
 Total Scan time:  0.000 Total Display time:  0.000

Function used was FASTA [version 35.04 Jan. 15, 2009]

User Query #1
>sp|P05790|FIBH_BOMMO Fibroin heavy chain OS=Bombyx mori GN=FIBH PE=1 SV=4
MRVKTFVILC CALQYVAYTN ANINDFDEDY FGSDVTVQSS NTTDEIIRDA SGAVIEEQIT TKKMQRKNKN HGILGKNEKM IKTFVITTDS DGNESIVEED VLMKTLSDGT VAQSYVAADA GAYSQSGPYV SNSGYSTHQG YTSDFSTSAA VGAGAGAGAA AGSGAGAGAG YGAASGAGAG AGAGAGAGYG TGAGAGAGAG YGAGAGAGAG AGYGAGAGAG AGAGYGAGAG AGAGAGYGAG AGAGAGAGYG AGAGAGAGAG YGAASGAGAG AGYGQGVGSG AASGAGAGAG AGSAAGSGAG AGAGTGAGAG YGAGAGAGAG AGYGAASGTG AGYGAGAGAG YGGASGAGAG AGAGAGAGAG AGYGTGAGYG AGAGAGAGAG AGAGYGAGAG AGYGAGYGVG AGAGYGAGYG AGAGSGAASG AGSGAGAGSG AGAGSGAGAG SGAGAGSGAG AGSGAGAGSG AGAGSGAGAG SGTGAGSGAG AGYGAGAGAG YGAGAGSGAA SGAGAGSGAG AGSGAGAGSG AGAGSGAGAG SGAGAGYGAG AGAGYGAGAG AGYGAGAGVG YGAGAGSGAA SGAGAGSGAG AGSGAGAGSG AGAGSGAGAG SGAGAGSGAG AGSGAGAGSG AGAGSGAGVG YGAGVGAGYG AGYGAGAGAG YGAGAGSGAA SGAGAGAGAG AGTGSSGFGP YVANGGYSRS DGYEYAWSSD FGTGSGAGAG SGAGAGSGAG AGSGAGAGSG AGAGSGAGAG YGAGVGVGYG AGYGAGAGAG YGAGAGSGAA SGAGAGSGAG AGSGAGAGSG AGAGSGAGAG SGAGAGSGAG AGSGAGAGSG AGAGSGAGAG SGAGVGSGAG AGSGAGAGVG YGAGAGVGYG AGAGSGAASG AGAGSGAGAG SGAGAGSGAG AGSGAGAGSG AGAGSGAGAG SGAGAGSGAG AGSGAGVGYG AGVGAGYGAG YGAGAGAGYG AGAGSGAASG AGAGSGAGAG SGAGAGSGAG AGSGAGAGSG AGAGSGAGAG SGAGAGSGAG AGSGAGAGSG AGAGSGAGAG YGAGAGAGYG AGYGAGAGAG YGAGAGSGAA SGAGSGAGAG SGAGAGAGSG AGAGSGAGAG SGAGAGSGAG AGSGAGAGSG AGAGYGAGVG AGYGAGYGAG AGAGYGAGAG SGAASGAGAG SGAGAGSGAG AGSGAGAGSG AGAGSGAGAG SGAGAGSGAG VGYGAGYGAG AGAGYGAGAG SGAASGAGAG AGAGAGTGSS GFGPYVAHGG YSGYEYAWSS ESDFGTGSGA GAGSGAGAGS GAGAGSGAGA GSGAGYGAGV GAGYGAGYGA GAGAGYGAGA GSGAGSGAGA GSGAGAGSGA GAGSGAGAGS GAGAGSGAGA GSGAGAGSGA GAGSGAGAGY GAGYGAGAGA GYGAGAGSGA GSGAGAGSGA GAGSGAGAGS GAGAGSGAGA GSGAGAGSGA GAGSGAGAGY GAGVGAGYGA GYGAGAGAGY GAGAGSGAGS GAGAGSGAGA GSGAGAGSGA GVGSGAGAGS GAGAGSGAGA GSGAGAGYGA GYGAGAGAGY GAGAGSGAGS GAGAGSGAGA GSGAGAGSGA GAGSGAGAGS GAGAGSGAGA GSGAGVGYGA GVGAGYGAGY GAGAGAGYGA GAGSGAASGA GAGAGAGAGT GSSGFGPYVA NGGYSGYEYA WSSESDFGTG SGAGAGSGAG AGSGAGAGSG AGAGSGAGAG YGAGYGAGAG AGYGAGAGSG AGSGAGAGSG AGAGSGAGAG SGAGAGSGAG AGSGAGAGSG AGAGSGAGSG SGAGAGSGAG AGSGAGAGYG AGVGAGYGVG YGAGAGAGYG AGAGSGAASG AGAGAGAGAG TGSSGFGPYV AHGGYSGYEY AWSSESDFGT GSGAGAGSGA GAGSGAGAGS GAGAGSGAGA GSGAGAGSGA GAGYGAGVGA GYGAAYGAGA GAGYGAGAGS GAASGAGAGS GAGAGSGAGA GSGAGAGSGA GAGSGAGAGS GAGAGSGAGA GSGAGAGSGA GAGSGAGAGY GAGAGAGYGA GAGSGAGSGA GAGSGAGAGS GAGAGSGAGA GSGAGAGSGA GSGSGAGAGS GAGAGSGAGA GYGAGVGAGY GAGYGAGAGA GYGAGAGSGA GSGAGAGSGA GAGYGAGAGA GYGAGYGAGA GAGYGAGAGT GAGSGAGAGS GAGAGSGAGA GSGAGAGSGA GAGSGAGAGS GAGSGSGAGA GSGAGAGSGA GAGSGAGAGS GAGAGSGAGA GYGAGAGAGY GAGYGAGAGA GYGAGAGSGA GSGAGAGSGA GAGSGAGAGS GAGAGYGAGY GAGAGSGAAS GAGAGAGAGA GTGSSGFGPY VAHGGYSGYE YAWSSESDFG TGSGAGAGSG AGAGAGAGAG SGAGAGYGAG VGAGYGAGYG AGAGAGYGAG AGSGTGSGAG AGSGAGAGYG AGVGAGYGAG AGSGAAFGAG AGAGAGSGAG AGSGAGAGSG AGAGSGAGAG SGAGAGYGAG YGAGVGAGYG AGAGSGAASG AGAGSGAGAG SGAGAGSGAG AGSGAGAGSG AGAGYGAGVG AGYGAGYGAG AGAGYGAGAG SGAASGAGAG SGAGAGAGSG AGAGSGAGAG SGAGAGSGAG SGAGAGSGAG AGSGAGAGYG AGAGSGAASG AGAGAGAGAG TGSSGFGPYV ANGGYSGYEY AWSSESDFGT GSGAGAGSGA GAGSGAGAGS GAGAGSGAGA GYGAGVGAGY GAGYGAGAGA GYGAGAGSGA GSGAGAGSGA GAGSGAGAGS GAGAGSGAGA GSGAGAGSGA GAGYGAGAGS GAASGAGAGS GAGAGSGAGA GSGAGAGSGA GAGSGAGAGS GAGAGYGAGV GAGYGVGYGA GAGAGYGAGA GSGAGSGAGA GSGAGAGSGA GAGSGAGAGS GAGSGAGAGS GAGAGSGAGA GSGAGSGAGA GSGAGAGYGV GYGAGAGAGY GAGAGSGAGS GAGAGSGAGA GSGAGAGSGA GSGAGAGSGA GAGSGAGAGS GAGAGYGAGV GAGYGVGYGA GAGAGYGAGA GSGAGSGAGA GSGAGAGSGA GAGSGAGAGS GAGAGSGAGA GSGAGAGSGA GSGAGAGSGA GAGSGAGAGS GAGAGSGAGS GAGAGSGAGA GSGAGAGSGA GAGYGAGVGA GYGVGYGAGV GAGYGAGAGS GAASGAGAGS GAGAGAGSGA GAGSGAGAGS GAGAGSGAGA GSGAGAGSGA GAGYGAGYGA GVGAGYGAGA GVGYGAGAGA GYGAGAGSGA ASGAGAGAGS GAGAGTGAGA GSGAGAGYGA GAGSGAASGA GAGAGAGAGT GSSGFGPYVA NGGYSGYEYA WSSESDFGTG SGAGAGSGAG AGSGAGAGSG AGAGSGAGAG YGAGVGAGYG AGAGSGAGSG AGAGSGAGAG SGAGAGSGAG AGSGAGAGYG AGAGSGTGSG AGAGSGAGAG SGAGAGSGAG AGSGAGAGSG AGAGSGVGAG YGVGYGAGAG AGYGVGYGAG AGAGYGAGAG SGTGSGAGAG SGAGAGSGAG AGSGAGAGSG AGAGSGAGAG SGAGAGYGAG VGAGYGVGYG AGAGAGYGAG AGSGAGSGAG AGSGAGAGSG AGAGSGAGAG SGAGSGAGAG SGAGAGSGAG AGSGAGSGAG AGSGAGAGYG VGYGAGAGAG YGAGAGSGAG SGAGAGSGAG AGSGAGAGSG AGSGAGAGSG AGAGSGAGAG SGAGAGYGAG VGAGYGVGYG AGAGAGYGAG AGSGAGSGAG AGSGAGAGSG AGAGSGAGAG SGAGAGSGAG AGSGAGAGSG AGSGAGAGSG AGAGSGAGAG SGAGAGYGAG VGAGYGVGYG AGAGAGYGAG AGSGAASGAG AGAGAGAGTG SSGFGPYVAN GGYSGYEYAW SSESDFGTGS GAGAGSGAGA GSGAGAGYGA GYGAGVGAGY GAGAGVGYGA GAGAGYGAGA GSGAASGAGA GAGAGAGSGA GAGSGAGAGA GSGAGAGYGA GYGIGVGAGY GAGAGVGYGA GAGAGYGAGA GSGAASGAGA GSGAGAGSGA GAGSGAGAGS GAGAGSGAGA GSGAGAGYGA GYGAGVGAGY GAGAGVGYGA GAGAGYGAGA GSGAASGAGA GAGAGAGAGS GAGAGSGAGA GSGAGAGSGA GAGSGAGAGS GAGAGSGAGA GSGAGAGSGA GAGYGAGVGA GYGAGYGGAG AGYGAGAGSG AASGAGAGSG AGAGSGAGAG SGAGAGSGAG AGSGAGAGYG AGAGSGAASG AGAGAGAGAG TGSSGFGPYV NGGYSGYEYA WSSESDFGTG SGAGAGSGAG AGSGAGAGYG AGVGAGYGAG YGAGAGAGYG AGAGSGAASG AGAGSGAGAG SGAGAGSGAG AGSGAGSGAG AGSGAGAGSG AGAGSGAGAG SGAGAGSGAG AGYGAGVGAG YGAGYGAGAG AGYGAGAGSG AASGAGAGSG AGAGAGSGAG AGSGAGAGSG AGAGSGAGAG SGAGAGSGAG SGAGAGSGAG AGYGAGYGAG VGAGYGAGAG VGYGAGAGAG YGAGAGSGAA SGAGAGSGSG AGSGAGAGSG AGAGSGAGAG AGSGAGAGSG AGAGSGAGAG YGAGYGAGAG SGAASGAGAG AGAGAGTGSS GFGPYVANGG YSGYEYAWSS ESDFGTGSGA GAGSGAGAGS GAGAGYGAGV GAGYGAGYGA GAGAGYGAGA GSGAGSGAGA GSGAGAGSGA GAGSGAGAGS GAGAGSGAGA GSGAGAGSGA GAGYGAGYGA GAGAGYGAGA GVGYGAGAGA GYGAGAGSGA GSGAGAGSGS GAGAGSGSGA GSGAGAGSGA GAGSGAGAGS GAGAGSGAGA GSGAGAGSGA GAGYGAGYGI GVGAGYGAGA GVGYGAGAGA GYGAGAGSGA ASGAGAGSGA GAGSGAGAGS GAGAGSGAGA GSGAGAGSGA GAGSGAGAGS GAGAGSGAGA GYGAGAGVGY GAGAGSGAAS GAGAGSGAGA GSGAGAGSGA GAGSGAGAGS GAGAGSGAGA GSGAGSGAGA GSGAGAGYGA GYGAGVGAGY GAGAGYGAGY GVGAGAGYGA GAGSGAGSGA GAGSGAGAGS GAGAGSGAGA GSGAGAGSGA GSGAGAGYGA GAGAGYGAGA GAGYGAGAGS GAASGAGAGA GAGSGAGAGS GAGAGSGAGS GAGAGSGAGA GYGAGAGSGA ASGAGAGSGA GAGAGAGAGA GSGAGAGSGA GAGYGAGAGS GAASGAGAGA GAGTGSSGFG PYVANGGYSR REGYEYAWSS KSDFETGSGA ASGAGAGAGS GAGAGSGAGA GSGAGAGSGA GAGGSVSYGA GRGYGQGAGS AASSVSSASS RSYDYSRRNV RKNCGIPRRQ LVVKFRALPC VNC

# fasta35.exe -q -H -B -m 9i -w 80 -E 1.0E-7 -d 20 C:\Windows\Temp\allE1F4.tmp fasta/version15.fasta
FASTA searches a protein or DNA sequence data bank
 version 35.04 Jan. 15, 2009
Please cite:
 W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: C:\Windows\Temp\allE1F4.tmp
  1>>>sp|P05790|FIBH_BOMMO Fibroin heavy chain OS=Bombyx mori GN=FIBH - 5263 aa
Library: fasta/version15.fasta  443551 residues in  1897 sequences

 443551 residues in  1897 sequences
Statistics:  Expectation_n fit: rho(ln(x))= 5.2585+/-0.0051; mu= 7.3057+/- 0.259
 mean_var=186.9191+/-52.715, 0's: 1 Z-trim: 13  B-trim: 91 in 1/42
 Lambda= 0.093810
Algorithm: FASTA (3.5 Sept 2006) [optimized]
Parameters: BL50 matrix (15:-5) ktup: 2
 join: 53, opt: 41, open/ext: -10/-2, width:  16
 Scan time:  0.000

The best scores are:                                           opt z-sc E(1897) %_id  %_sim  alen
gi|27806257|gid|1565|collagen alpha-2(I) chain precurso (1364)  726 547.9  2e-025 0.261 0.503 1082
gi|71084277|gid|160|HMW glutenin x-type subunit Bx7 pre ( 795)  358 280.8 1.5e-010 0.257 0.401  654
gi|736319|gid|160|glutenin [Triticum aestivum]          ( 838)  300 238.2 3.5e-008 0.261 0.387  710
gi|22090|gid|160|HMW glutenin subunit 1By9 [Triticum ae ( 705)  290 231.5 8.2e-008 0.248 0.409  553

>>>sp|P05790|FIBH_BOMMO, 5263 aa vs fasta/version15.fasta library

>>gi|27806257|gid|1565|collagen alpha-2(I) chain precursor [Bos taurus]       (1364 aa)
 initn: 1292 init1: 489 opt: 726  Z-score: 547.9  bits: 115.7 E(): 2e-025
Smith-Waterman score: 773; 26.1% identity (50.3% similar) in 1082 aa overlap (4052-5117:35-1109)

            4020      4030      4040      4050      4060      4070      4080      4090
sp|P05 AGAGYGAGAGSGAASGAGAGAGAGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAG
                                               ::  .  :  .  :  . .:  .  :  .  :  .  : :
gi|278       MLSFVDTRTLLLLAVTSCLATCQSLQEATARKGPSGDRGPRGERGPPGPPGRDGDDGIPGPPGPPGPPGPPGLG
                     10        20        30        40        50        60        70

            4100       4110      4120      4130      4140      4150       4160
sp|P05 AGYGAGVGA-GYGAGYGGAGAGYGAGAGSGAASGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGA-GYGAGAGSGAAS
       ....:   : : : :  :  .  :  ..::: .  :  .  :  .  :  . ::: .  :  . ::  :. .  :  .
gi|278 GNFAAQFDAKGGGPGPMGLMGPRGPPGASGAPGPQGFQGPPGEPGEPGQTGPAGARGPPGPPGKAGEDGHPGKPGRPGER
           80        90       100       110       120       130       140       150

    4170      4180      4190        4200      4210      4220      4230      4240
sp|P05 GAGAGAGAGAGTGSSGFGPYVNG--GYSGYEYAWSSESDFGTGSGAGAGSGAGAGSGAGAGYGAGVGAGYGAGYGAGAGA
       :. .  :: .  :. :. :  .:  :..: .   .. .  :. .  :: .  :. . .::    :  .  ::   ::: .
gi|278 GVVGPQGARGFPGTPGL-PGFKGIRGHNGLDGLKGQPGAPGVKGEPGAPGENGTPGQTGARGLPGERGRVGAPGPAGARG
          160       170        180       190       200       210       220       230

      4250      4260      4270      4280      4290       4300      4310       4320
sp|P05 GYGAGAGSGAASGAGAGSGAGAGSGAGAGSGAGAGSGAGSGAGAGSGAGAG-SGAGAGSGAGAGSGAGAGSGA-GAGYGA
       . :. .  : :.  :...  :  .. :  .  :  .. : .. ::  . .:  : ..  :  .. ::..  :: ::.
gi|278 SDGSVGPVGPAGPIGSAGPPGFPGAPGPKGELGPVGNPGPAGPAGPRGEVGLPGLSGPVGPPGNPGANGLPGAKGAAGLP
           240       250       260       270       280       290       300       310

        4330      4340      4350      4360      4370      4380       4390      4400
sp|P05 GVGAGYGAGYGAGAGAGYGAGAGSGAASGAGAGSGAGAGAGSGAGAGSGAGAGSGAGAGSGA-GAGSGAGAGSGAGSGAG
       ::... :     :  .  ::....:: . .:  . ::. . ::  .  :: .  :  . ::  :  ...:  . ::  .
gi|278 GVAGAPGLPGPRGIPGPVGAAGATGARGLVGEPGPAGSKGESGNKGEPGAVGQPGPPGPSGEEGKRGSTGEIGPAGPPGP
           320       330       340       350       360       370       380       390

         4410      4420      4430      4440       4450      4460       4470      4480
sp|P05 AGSGAGAGYGAGYGAGVGAGYGAGAGVGYGAGAGAGY-GAGAGSGAASGAGAGSGSG-AGSGAGAGSGAGAGSGAGAGAG
        :  .. :  .  ::   ::  . :: . :: . ::  : .. ::  .  :  .  :  :: .. : ..  :  .  :
gi|278 PGLRGNPGSRGLPGADGRAGVMGPAG-SRGATGPAGVRGPNGDSGRPGEPGLMGPRGFPGSPGNIGPAGKEGPVGLPGID
           400       410        420       430       440       450       460       470

           4490       4500      4510       4520       4530      4540      4550
sp|P05 SGAGAGSGAGA-GSGAGAGYGAGYGAGAGSGAASGAG-AG-AGAGAGTGSSGFGPYVANGGYSGYEYAWSSESDFGTGSG
       .  :  . ::: :  .. :. .  : ..  : :.  : :: ::: .. : .: .   .  : .: . . . ..  :  .
gi|278 GRPGPIGPAGARGEPGNIGFPGPKGPSGDPGKAGEKGHAGLAGARGAPGPDGNNGAQGPPGLQGVQGGKGEQGPAGPPGF
            480       490       500       510       520       530       540       550

    4560      4570      4580      4590      4600      4610      4620      4630
sp|P05 AGAGSGAGAGSGAGAGYGAGVGAGYGAGYGAGAGAGYGAGAGSGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAG
        :  . ::... ::     :. . .:    ::: .  :  . :::.. .:  .. : ..  :  .. :  . .:: . ::
gi|278 QGLPGPAGTAGEAGKPGERGIPGEFGLPGPAGARGERGPPGESGAAGPTGPIGSRGPSGPPGPDGNKGEPGVVGAPGTAG
            560       570       580       590       600       610       620       630

    4640      4650      4660      4670      4680      4690      4700      4710
sp|P05 AGSGAGAGSGAGAGYGAGYGAGAGAGYGAGAGVGYGAGAGAGYGAGAGSGAGSGAGAGSGSGAGAGSGSGAGSGAGAGSG
        .. .:  .  ::. :   : :  .  :  . .: . :  .. :: .. :: . :::..  :  :: .. :: ..  ::
gi|278 PSGPSGLPGERGAA-GIPGGKGEKGETGLRGDIG-SPGRDGARGAPGAIGAPGPAGANGDRGE-AGPAGPAGPAGPRGSP
            640        650       660        670       680       690        700

    4720      4730      4740      4750      4760      4770      4780      4790
sp|P05 AGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGYGAGYGIGVGAGYGAGAGVGYGAGAGAGYGAGAGSGAASGAGAGSG
       .  :  . :: .. ::  :::..  :: .  :.    : .  ::    .::.   : ..  : ... :.:.  :: .  :
gi|278 GERGEVGPAGPNGFAGP-AGAAGQPGAKGERGTKGPKGENGPVGPTGPVGAAGPSGPNGPPGPAGSRGDGGPPGATGFPG
     710       720        730       740       750       760       770       780

    4800      4810      4820      4830      4840      4850      4860      4870
sp|P05 AGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGYGAGAGVGYGAGAGSGAASGAGAGSGAG
       :.. .:  . :: ..  :  . .:  .  :  . .:  . ::  ..::  .  :  .  :  . ::  .. :  .  ::
gi|278 AAGRTGPPGPSGISGPPGPPGPAGKEGLRGPRGDQGPVGRSGETGASGPPGFVGEKGPSGEPGTAGPPGTPGPQGLLGAP
      790       800       810       820       830       840       850       860

    4880      4890      4900      4910      4920      4930      4940        4950
sp|P05 AGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGSGAGAGSGAGAGYGAGYGAGVGAGYGAGA--GYGAGYGVGAGAG
       .  :  .. :  .  :. ::: .  :  . ::  .. :  .. :  .  ::   ::  .. :  .  :  .  :  .  :
gi|278 GFLGLPGSRGERGLPGV-AGSVGEPGPLGIAGPPGARGPPGNVGNPGVNGAPGEAGRDGNPGNDGPPGRDGQPGHKGERG
      870       880        890       900       910       920       930       940

      4960      4970      4980      4990      5000      5010      5020      5030
sp|P05 YGAGAGSGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGSGAGAGYGAGAGAGYGAGAGAGYGAGAGSGAASGAG
       : ..::  ...:: . .:  .  :  .. :  . .:: . .:: .  : .   :  .  :  .  :  .  :  . .:
gi|278 YPGNAGPVGAAGAPGPQGPVGPVGKHGNRGEPGPAGAVGPAGAVGPRGPSGPQGIRGDKGEPGDKGPRGLPGLKGHNGLQ
       950       960       970       980       990      1000      1010      1020

      5040      5050      5060      5070       5080       5090      5100      5110
sp|P05 AGAGAGSGAGAGSGAGAGSGAGSGAGAGSGAGAGYGAGAGS-GAASGAGA-GSGAGAGAGAGAGAGSGAGAGSGAGAGYG
       .  : ..  :  .. :: . ::  . :: .. ::  .  :. ::.. ::  :: .. : ..  :  .  :  . .:.::
gi|278 GLPGLAGHHGDQGAPGAVGPAGPRGPAGPSGPAGKDGRIGQPGAVGPAGIRGSQGSQGPAGPPGPPGPPGPPGPSGGGYE
      1030      1040      1050      1060      1070      1080      1090      1100

        5120      5130      5140      5150      5160      5170      5180      5190
sp|P05 AGAGSGAASGAGAGAGAGTGSSGFGPYVANGGYSRREGYEYAWSSKSDFETGSGAASGAGAGAGSGAGAGSGAGAGSGAG
        :
gi|278 FGFDGDFYRADQPRSPTSLRPKDYEVDATLKSLNNQIETLLTPEGSRKNPARTCRDLRLSHPEWSSGYYWIDPNQGCTMD
      1110      1120      1130      1140      1150      1160      1170      1180

>>gi|71084277|gid|160|HMW glutenin x-type subunit Bx7 precursor [Triticum ae  (795 aa)
 initn: 126 init1:  87 opt: 358  Z-score: 280.8  bits: 65.5 E(): 1.5e-010
Smith-Waterman score: 390; 25.7% identity (40.1% similar) in 654 aa overlap (1849-2461:119-764)

     1810      1820      1830      1840      1850      1860      1870      1880
sp|P05 SGAGAGAGAGAGTGSSGFGPYVAHGGYSGYEYAWSSESDFGTGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGS
                                               : .:   .:.:   :.    :.:    . .   .:    :
gi|710 YPSETTPSQQLQQMIFWGIPALLRRYYPSVTSSQQGSYYPGQASPQQSGQGQQPGQEQQPGQGQQDQQPGQRQQGYYPTS
       80        90       100       110       120       130       140       150

     1890      1900      1910      1920      1930      1940      1950       1960
sp|P05 GAGAGYGAGVGAGYGAAYGAGAGAGYGAGAGSGAASGAGAGSGAGAGSGAGAGSGAGAGSGA-GAGSGAGAGSGAGAGSG
           : :  .: :  . : ..   :    ::.:  :: :  .:    :   .:.:   :.:  :    .   ::     :
gi|710 PQQPGQGQQLGQGQPGYYPTSQQPGQKQQAGQGQQSGQGQ-QGYYPTSPQQSGQGQQPGQGQPGYYPTSPQQSGQWQQPG
      160       170       180       190        200       210       220       230

      1970      1980       1990       2000      2010      2020      2030      2040
sp|P05 AGAGSGAGAGSGAGA-GSGAGAGYGAGAGA-GYGAGAGSGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGSGS-
        :   : :  :: :  :.  : :   : :  ::   . .  :.:  .:.:   :    .    :  .  : :.  :.:.
gi|710 QGQQPGQGQQSGQGQQGQQPGQGQRPGQGQQGYYPISPQQPGQGQQSGQGQ-PGYYPTSLRQPGQWQQPGQGQQPGQGQQ
       240       250       260       270       280        290       300       310

         2050      2060       2070      2080      2090      2100       2110      2120
sp|P05 GAGAGSGAGAGSGAGAGYGAGVGA-GYGAGYGAGAGAGYGAGAGSGAGSGAGAGSGAGAGY-GAGAGAGYGAGYGAGAGA
       :   :.:  .:.:  . : ...   : :   : :  . : ..  :  :.  : :.  : :  :    .   .: :   :
gi|710 GQQPGQGQQSGQGQQGYYPTSLQQPGQGQQLGQGQPGYYPTSQQSEQGQQPGQGKQPGQGQQGYYPTSPQQSGQGQQLGQ
        320       330       340       350       360       370       380       390

            2130      2140       2150      2160       2170      2180      2190
sp|P05 GY-GAGAGTGAGSGAGAGSGAGA-GSGAGAGSGAGAGSGAGAG-SGAGAGSGAGSGSGAGAGSGAGAGSGAGAGSGAGAG
       :  :    .   :: :  :: :  :    . . .: :.  : : ::    :   ::.:   :.:  .:.:   :.  : :
gi|710 GQPGYYPTSPQQSGQGQQSGQGQQGYYPTSPQQSGQGQQPGQGQSGYFPTSRQQSGQGQQPGQGQQSGQGQ-QGQQPGQG
        400       410       420       430       440       450       460        470

    2200               2210       2220      2230      2240               2250      2260
sp|P05 SGAGAGSG---------AGAGYGAGAGA-GYGAGYGAGAGAGYGAGAGSGAG---------SGAGAGSGAGAGSGAGAGS
       . :   ..         ::     : :  ::        :    .: .. .:         :    :.    ..:   ..
gi|710 QQAYYPTSSQQSRQRQQAGQWQRPGQGQPGYYPTSPQQPGQEQQSGQAQQSGQWQLVYYPTSPQQPGQLQQPAQGQQPAQ
         480       490       500       510       520       530       540       550

             2270             2280      2290      2300      2310       2320       2330
sp|P05 GAGAGYGAGYGAGAGSGA-------ASGAGAGAGAGAGTGSSGFGPYVAHGGYSGYE-YAWSSESDFGTGS-GAGAGSGA
       :  ..     : .  ::        .:    :     . :..:. :   . . .: . :  .: .. : :. :    :
gi|710 GQQSAQEQQPGQAQQSGQWQLVYYPTSPQQPGQLQQPAQGQQGYYPTSPQQSGQGQQGYYPTSPQQSGQGQQGYYPTSPQ
         560       570       580       590       600       610       620       630

            2340      2350      2360      2370       2380      2390      2400      2410
sp|P05 GAGAGAGAGSGAGAGYGAGVGAGYGAGYGAGAGAGYGAGAGS-GTGSGAGAGSGAGAGYGAGVGAGYGAGAGSGAAFGAG
        .: :   :.:     :     ::       .: :   : :. :    .   :: :   :     :     :.:   :
gi|710 QSGQGQQPGQGQQPRQGQ---QGYYPISPQQSGQGQQPGQGQQGYYPTSPQQSGQGQQPGHEQQPGQWLQPGQGQQ-GYY
         640       650          660       670       680       690       700        710

             2420      2430      2440       2450        2460      2470      2480
sp|P05 AGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGYGA-GYGAG--VGAGYGAGAGSGAASGAGAGSGAGAGSGAGAGS
         ..  :: :  :: :  .:    :    :.:   : :  ::..   :.: : :
gi|710 PTSSQQSGQGHQSGQGQ-QGYYPTSLWQPGQGQQPGQGQQGYASPYHVSAEYQAARLKVAKAQQLAAQLPAMCRLEGSDA
             720        730       740       750       760       770       780       790

      2490      2500      2510      2520      2530      2540      2550      2560
sp|P05 GAGAGSGAGAGSGAGAGYGAGVGAGYGAGYGAGAGAGYGAGAGSGAASGAGAGSGAGAGAGSGAGAGSGAGAGSGAGAGS

gi|710 LSTRQ


>>gi|736319|gid|160|glutenin [Triticum aestivum]                              (838 aa)
 initn:  51 init1:  51 opt: 300  Z-score: 238.2  bits: 57.7 E(): 3.5e-008
Smith-Waterman score: 354; 26.1% identity (38.7% similar) in 710 aa overlap (3823-4515:126-809)

           3790      3800      3810      3820      3830      3840      3850      3860
sp|P05 AGAGAGTGSSGFGPYVANGGYSGYEYAWSSESDFGTGSGAGAGSGAGAGSGAGAGYGAGYGAGVGAGYGAGAGVGYGAGA
                                               : .:    :.:   : :   : :   ::   .    :
gi|736 YPGETTPPQQLQQRIFWGIPALLKRYYPSVTSPQQVSYYPGQASPQRPGQGQQPGQGQQSGQG-QQGYYPTSPQQPGQWQ
          90       100       110       120       130       140        150       160

           3870      3880      3890       3900      3910      3920      3930       3940
sp|P05 GAGYGAGAGSGAASGAGAGAGAGAGSGAGAGSGA-GAGAGSGAGAGYGAGYGIGVGAGYGAGAGVGYGAGAGAGY-GAGA
           :   :   .:    :     ..:   :.:  :   :.:  . : ..  .  :     . :   :   : :  :
gi|736 QPEQGQ-PGYYPTSPQQPGQLQQPAQGQQPGQGQQGRQPGQGQPGYYPTSSQLQPGQLQQPAQG-QQGQQPGQGQQGQQP
          170        180       190       200       210       220        230       240

             3950      3960      3970       3980      3990       4000      4010
sp|P05 GSGAASGAGAGSGAGAGSGAGAGSGAGAGSGAGAGS-GAGAGSGAGAGYGA-GYGAGVGAGYGAGAGVGYGAGAGAGYGA
       :.:   : :  .:   :.:   :.:   :.  : :. :    :   .: :  ::        : : . ::   .    :
gi|736 GQGQQPGQGQ-QGQQPGQGQQPGQGQ-QGQQLGQGQQGYYPTSLQQSGQGQPGYYPTSLQQLGQGQS-GYYPTSPQQPGQ
            250        260        270       280       290       300        310

     4020      4030       4040       4050      4060       4070       4080      4090
sp|P05 GAGSGAASGAGAGAGAGAGA-GSGAGAGS-GAGAGSGAGAGSGA-GAGSGAGAGSGAGA-GSGAGAGSGAGAGSGAGAGY
       :   :  .  . :     :  :.  : :. :   :.:   :.:  :    .   :: :  :    ...    ..  : :
gi|736 GQQPGQLQQPAQGQQPEQGQQGQQPGQGQQGQQPGQGQQPGQGQPGYYPTSPQQSGQGQPGYYPTSSQQPTQSQQPGQGQ
     320       330       340       350       360       370       380       390

          4100      4110       4120      4130       4140      4150      4160      4170
sp|P05 -GAGVGAGYGAGYGGAGAGYGAGA-GSGAASGAGAGSGA-GAGSGAGAGSGAGAGSGAGAGSGAGAGYGAGAGSGAASGA
        :  :: :  :   : :   : :  :   .:   .:.:  :    .   :: :   :    :. :   :   :.:   :
gi|736 QGQQVGQGQQAQQPGQGQQPGQGQPGYYPTSPLQSGQGQPGYYLTSPQQSGQGQQPGQLQQSAQGQK-GQQPGQGQQPGQ
     400       410       420       430       440       450       460        470

            4180      4190      4200      4210      4220      4230        4240
sp|P05 GAGAGAGAGTGSSGFGPYVNGGYSGYEYAWSSESDFGTGSGAGAGSGAGAGSGAGAGYGAG-VGAGYGA-GYGAGAGAGY
       :   :   : :..:  :  . :  :: :  : ..   .:.:   :.    :.:  . : .. .  : :  ::   .
gi|736 GQ-QGQQPGQGQQGQQP--GQGQPGY-YPTSPQQ---SGQGQQPGQWQQPGQGQPGYYPTSPLQPGQGQPGYDPTSPQQP
      480        490         500           510       520       530       540       550

    4250      4260       4270      4280      4290      4300      4310       4320
sp|P05 GAGAGSGAASGAGAGS-GAGAGSGAGAGSGAGAGSGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGS-GAGAGYGA-G
       : :   :  .  . :. :   ..:  . . : . .:   . :  .:   :.:   :.  : :.  . :. :   : :  :
gi|736 GQGQQPGQLQQPAQGQQGQQLAQGQQGQQPAQVQQGQQPAQGQ-QGQQLGQGQ-QGQQPGQGQQPAQGQQGQQPGQGQQG
             560       570       580       590        600        610       620

       4330      4340      4350      4360      4370      4380      4390      4400
sp|P05 VGAGYGAGYGAGAGAGYGAGAGSGAASGAGAGSGAGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGSGAGAG
          : :   : :    : .   :   :: :   :     :.    :.:     :    :    :.:   :    :    :
gi|736 QQPGQGQQPGQGQPWYYPT---SPQESGQGQQPGQWQQPGQWQQPGQGQ---PGYYLTSPLQLGQGQ-QGYYPTSLQQPG
     630       640          650       660       670          680       690        700

       4410      4420      4430      4440      4450      4460      4470      4480
sp|P05 SGAGAGYGAGYGAGVGAGYGAGAGVGYGAGAGAGYGAGAGSGAASGAGAGSGSGAGSGAGAGSGAGAGSGAGAGAGSGAG
       .:   :     : :   ::   .    : :   :     :.:  .:    : . .:.:   :.    :.:   :    .
gi|736 QGQQPGQWQQSGQG-QHGYYPTSPQLSGQGQRPGQWLQPGQGQ-QGYYPTSPQQSGQGQQLGQWLQPGQGQ-QGYYPTSL
            710        720       730       740        750       760       770

       4490       4500      4510      4520      4530      4540      4550      4560
sp|P05 AGSGAGAGSGAGA-GYGAGYGAGAGSGAASGAGAGAGAGAGTGSSGFGPYVANGGYSGYEYAWSSESDFGTGSGAGAGSG
         .: :  :: :  :: ..: ...   :::
gi|736 QQTGQGQQSGQGQQGYYSSYHVSVEHQAASLKVAKAQQLAAQLPAMCRLEGGDALSASQ
     780       790       800       810       820       830

>>gi|22090|gid|160|HMW glutenin subunit 1By9 [Triticum aestivum]              (705 aa)
 initn:  72 init1:  72 opt: 290  Z-score: 231.5  bits: 56.2 E(): 8.2e-008
Smith-Waterman score: 336; 24.8% identity (40.9% similar) in 553 aa overlap (1184-1724:118-656)

          1150      1160      1170      1180      1190      1200      1210      1220
sp|P05 ASGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGVGYGAGYGAGAGAGYGAGAGSGAASGAGAGAGA
                                               :..: .  ::  . ..   . :  : .:    : :   :
gi|220 PVAVSQVVRQYEQTVVPPKGGSFYPGETTPLQQLQQVIFWGTSSQTVQGYYPSVSSPQQGPYYPGQASPQQPGQGQQPGK
        80        90       100       110       120       130       140       150

            1230      1240       1250      1260       1270      1280       1290
sp|P05 GA--GTGSSGFGPYVAHGGYSGYE-YAWSSESDFGTGSGAGAGS-GAGAGSGAGAGSGAGAGSGA-GYGAGVGAGYGAGY
           : :..:. :   : . .: . :  :: .. : :.  : :. :    :    :.:   :.:  ::        :
gi|220 WQELGQGQQGYYPTSLHQSGQGQQGYYPSSLQQPGQGQQIGQGQQGYYPTSLQQPGQGQQIGQGQQGYYPTSPQHPGQRQ
       160       170       180       190       200       210       220       230

     1300      1310      1320       1330      1340      1350      1360      1370
sp|P05 GAGAGAGYGAGAGSGAGSGAGAGSGAGAGS-GAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGYGAGYGAG
         : :   : :   : :   : :. .: :. :    :    :.:   :.   .:.:   :    . .  : :  . : :.
gi|220 QPGQGQQIGQGQQLGQGRQIGQGQQSGQGQQGYYPTSPQQLGQGQQPGQWQQSGQGQ-QGYYPTSQQQPGQGQQGQYPAS
       240       250       260       270       280       290        300       310

      1380      1390      1400      1410      1420      1430      1440         1450
sp|P05 AGAGYGAGAGSGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGYGAGV---GAGYGAGYGA
             .  :.  .:    :.:    .:   .:    :.:    .:   .:    :.:    : :..   : :  . : :
gi|220 QQQPGQGQQGQYPASQQQPGQGQ---QGQYPASQQQPGQGQ---QGHYLASQQQPGQGQQRHYPASLQQPGQGQQGHYTA
        320       330          340       350          360       370       380       390

         1460      1470      1480       1490      1500      1510      1520       1530
sp|P05 GAGAGYGAGAGSGAGSGAGAGSGAGAGS-GAGAGSGAGVGSGAGAGSGAGAGSGAGAGSGAGAGYGAGYGAGAGA-GYGA
       .      .  :   .:   .:.:   :. :     : :  .  :     :   : :  .  :     :   : :  ::
gi|220 SLQQPGQGQQGHYPASLQQVGQGQQIGQLGQRQQPGQGQQTRQGQQLEQGQQPGQGQQTRQGQQLEQGQQPGQGQQGYYP
              400       410       420       430       440       450       460       470

           1540      1550       1560      1570      1580      1590      1600      1610
sp|P05 GAGSGAGSGAGAGSGAGAGSGA-GAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGVGYGAGVGAGYGAGYGAGAGAGYGAG
        . . .:.:   :..   :.:  :  :..    : :  .:   .:    :.:   :     : :     :   : :
gi|220 TSPQQSGQGQQPGQSQQPGQGQQGYYSSSLQQPGQGL-QGHYPASLQQPGQGHP-GQRQQPGQGQQPEQGQQPGQGQ---
              480       490       500        510       520        530       540

            1620      1630      1640      1650      1660      1670      1680      1690
sp|P05 AGSGAASGAGAGAGAGAGTGSSGFGPYVANGGYSGYEYAWSSESDFGTGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGY
        :   .:    : :   : :..:. :   .   .: . . ....   : :   .:..   :.:   :.    :.:   ::
gi|220 QGYYPTSPQQPGQGKQLGQGQQGYYPTSPQQPGQGQQPGQGQQGHCPT-SPQQTGQAQQPGQGQQIGQVQQPGQGQ-QGY
         550       560       570       580       590        600       610       620

            1700      1710      1720      1730      1740      1750      1760      1770
sp|P05 GAGYGAGAGAGYGAGAGSGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGAGSGAGSGSGAGAGSGAGA
              .: :  .: :. .:.:   :.:  .:
gi|220 YPISLQQSGQGQQSGQGQQSGQGHQLGQGQQSGQEQQGYDNPYHVNTEQQTASPKVAKVQQPATQLPIMCRMEGGDALSA
           630       640       650       660       670       680       690       700



5263 residues in 1 query   sequences
443551 residues in 1897 library sequences
 Scomplib [35.04]
 start: Wed Sep 16 15:14:43 2015 done: Wed Sep 16 15:14:44 2015
 Total Scan time:  0.000 Total Display time:  1.000

Function used was FASTA [version 35.04 Jan. 15, 2009]

>MaSP2 15-mer
MHHHHHHHHK LAVGGYGPGQ QGPGGYGPGQ QGPSGPGSAA AAAAAAVGGY GPGQQGPGGY GPGQQGPSGP GSAAAAAAAA VGGYGPGQQG PGGYGPGQQG PSGPGSAAAA AAAAVGGYGP GQQGPGGYGP GQQGPSGPGS AAAAAAAAVG GYGPGQQGPG GYGPGQQGPS GPGSAAAAAA AAVGGYGPGQ QGPGGYGPGQ QGPSGPGSAA AAAAAAVGGY GPGQQGPGGY GPGQQGPSGP GSAAAAAAAA VGGYGPGQQG PGGYGPGQQG PSGPGSAAAA AAAAVGGYGP GQQGPGGYGP GQQGPSGPGS AAAAAAAAVG GYGPGQQGPG GYGPGQQGPS GPGSAAAAAA AAVGGYGPGQ QGPGGYGPGQ QGPSGPGSAA AAAAAAVGGY GPGQQGPGGY GPGQQGPSGP GSAAAAAAAA VGGYGPGQQG PGGYGPGQQG PSGPGSAAAA AAAAVGGYGP GQQGPGGYGP GQQGPSGPGS AAAAAAAAVG GYGPGQQGPG GYGPGQQGPS GPGSAAAAAA AAVGS

# fasta35.exe -q -H -B -m 9i -w 80 -E 1.0E-7 -d 20 C:\Windows\Temp\allA16C.tmp fasta/version15.fasta
FASTA searches a protein or DNA sequence data bank
 version 35.04 Jan. 15, 2009
Please cite:
 W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448

Query: C:\Windows\Temp\allA16C.tmp
  1>>>MaSP2 15-mer - 525 aa
Library: fasta/version15.fasta  443551 residues in  1897 sequences

 443551 residues in  1897 sequences
Statistics:  Expectation_n fit: rho(ln(x))= 5.1769+/-0.00664; mu= 4.7971+/- 0.347
 mean_var=225.0527+/-69.614, 0's: 1 Z-trim: 28  B-trim: 0 in 0/43
 Lambda= 0.085493
Algorithm: FASTA (3.5 Sept 2006) [optimized]
Parameters: BL50 matrix (15:-5) ktup: 2
 join: 37, opt: 25, open/ext: -10/-2, width:  16
 Scan time:  0.000

The best scores are:                                           opt z-sc E(1897) %_id  %_sim  alen
gi|27806257|gid|1565|collagen alpha-2(I) chain precurso (1364)  786 545.8 2.6e-025 0.394 0.570  611
gi|736319|gid|160|glutenin [Triticum aestivum]          ( 838)  638 448.9 6.5e-020 0.336 0.462  539
gi|288860106|gid|160|high molecular weight glutenin sub ( 848)  584 412.8 6.6e-018 0.328 0.463  549
gi|21751|gid|160|high molecular weight glutenin subunit ( 648)  489 350.4  2e-014 0.316 0.455  538
gi|21779|gid|160|HMW glutenin-like protein product [Tri ( 660)  480 344.4 4.3e-014 0.324 0.458  555
gi|71084277|gid|160|HMW glutenin x-type subunit Bx7 pre ( 795)  471 337.7  1e-013 0.326 0.491  601
gi|21743|gid|160|high molecular weight glutenin subunit ( 830)  459 329.6 2.9e-013 0.320 0.443  575
gi|22090|gid|160|HMW glutenin subunit 1By9 [Triticum ae ( 705)  457 328.8 3.1e-013 0.326 0.475  564
gi|508732625|gid|160|high molecular weight glutenin sub ( 794)  426 307.7 4.7e-012 0.319 0.431  562
gi|170743|gid|160|HMW glutenin subunit Ax2* [Triticum a ( 815)  426 307.6 4.8e-012 0.319 0.431  562

>>>MaSP2, 525 aa vs fasta/version15.fasta library

>>gi|27806257|gid|1565|collagen alpha-2(I) chain precursor [Bos taurus]       (1364 aa)
 initn: 586 init1: 211 opt: 786  Z-score: 545.8  bits: 112.0 E(): 2.6e-025
Smith-Waterman score: 945; 39.4% identity (57.0% similar) in 611 aa overlap (14-516:476-1060)

                                          10         20          30         40
MaSP2                             MHHHHHHHHKLAVGGYGP-GQQG-PGGYG-PGQQGPSG-PGSAA----AAAAA
                                               :  :: : .: ::. : :: .:::: ::.:.    :. :.
gi|278 SGRPGEPGLMGPRGFPGSPGNIGPAGKEGPVGLPGIDGRPGPIGPAGARGEPGNIGFPGPKGPSGDPGKAGEKGHAGLAG
         440       450       460       470       480       490       500       510

          50            60             70            80        90         100
MaSP2  AVGGYGP----GQQGPGGY-----GPGQQGPSGP----GSAAAAAAAAVGGYGPGQQG-PGGYG-PGQQGPSG----PG-
       : :. ::    : ::: :      : :.:::.::    :  . :..:. .:  ::..: :: .: ::  :  :    ::
gi|278 ARGAPGPDGNNGAQGPPGLQGVQGGKGEQGPAGPPGFQGLPGPAGTAGEAGK-PGERGIPGEFGLPGPAGARGERGPPGE
         520       530       540       550       560        570       580       590

         110       120                130        140       150       160            170
MaSP2  SAAAAAAAAVGGYGP-GQQGPGGY--------GPGQQGPSGP-GSAAAAAAAAVGGYGPGQQGPGGY-----GPGQQGPS
       :.::. .. .:. :: :  :: :         .::  ::::: :  .  .::.. : : :..:  :      .::..:
gi|278 SGAAGPTGPIGSRGPSGPPGPDGNKGEPGVVGAPGTAGPSGPSGLPGERGAAGIPG-GKGEKGETGLRGDIGSPGRDGAR
          600       610       620       630       640       650        660       670

               180            190         200          210               220
MaSP2  G-PGSAAAAAAAAVGG----YGP-GQQGPGG-YG-PGQQG---PSGPGSAAAAAAAA--------VGGYGP-GQQGP-GG
       : ::. .: . :...:     :: :  ::.:  : ::..:   :.::.. :. :.::         :  :: :..:: :
gi|278 GAPGAIGAPGPAGANGDRGEAGPAGPAGPAGPRGSPGERGEVGPAGPNGFAGPAGAAGQPGAKGERGTKGPKGENGPVGP
           680       690       700       710       720       730       740       750

     230        240       250       260            270       280       290
MaSP2  YGP-GQQGPSGPGSAAAAAAAAVGGYGPGQQG-PGGYG----PGQQGPSGPGSAAAAAAAAVGGYGP-GQQGP-------
        :: :  :::::..  . :..   :  ::  : ::. :    :: .: ::: .  . :.   :  :: :.:::
gi|278 TGPVGAAGPSGPNGPPGPAGSRGDGGPPGATGFPGAAGRTGPPGPSGISGPPGPPGPAGKE-GLRGPRGDQGPVGRSGET
           760       770       780       790       800       810        820       830

         300            310       320           330        340        350        360
MaSP2  GGYGP----GQQGPSG-PGSAAAAAAAAVGGYGPGQQG----PGGYG-PGQQGPSG-PGSAAAAAAAAVGGYGP-GQQGP
       :. ::    :..:::: ::.:.  ..       :: ::    ::  : ::..:  : ::     .:..::  :: :  ::
gi|278 GASGPPGFVGEKGPSGEPGTAGPPGT-------PGPQGLLGAPGFLGLPGSRGERGLPG-----VAGSVGEPGPLGIAGP
            840       850              860       870       880            890       900

           370           380       390       400          410       420       430
MaSP2  GGYGPGQQGPSG----PGSAAAAAAAAVGGYGPGQQGPGGYG--PGQQGPSG-PGSAAAAAAAAVGGYGPGQQGPGGYGP
           :: .:: :    ::  .: . :.  : .::..:: :    ::..:  : ::.:. ..::..    :: :::   ::
gi|278 ----PGARGPPGNVGNPGVNGAPGEAGRDG-NPGNDGPPGRDGQPGHKGERGYPGNAGPVGAAGA----PGPQGP--VGP
                  910       920        930       940       950       960

         440        450        460          470        480       490          500
MaSP2  -GQQGPSG-PGSAAAAA-AAAVGGYGP-GQQGP-GGYG-PGQQGPSG-PGSAAAAAAAAVGGYGP--GQQG-PGGYGP-G
        :..:  : :: :.:.. :.:::  :: : ::  :  : ::..:: : ::  .  .  .. : .   :.:: ::. :: :
gi|278 VGKHGNRGEPGPAGAVGPAGAVGPRGPSGPQGIRGDKGEPGDKGPRGLPGLKGHNGLQGLPGLAGHHGDQGAPGAVGPAG
     970       980       990      1000      1010      1020      1030      1040

         510       520
MaSP2  QQGPSGPGSAAAAAAAAVGS
        .::.::.. :
gi|278 PRGPAGPSGPAGKDGRIGQPGAVGPAGIRGSQGSQGPAGPPGPPGPPGPPGPSGGGYEFGFDGDFYRADQPRSPTSLRPK
    1050      1060      1070      1080      1090      1100      1110      1120

>>gi|736319|gid|160|glutenin [Triticum aestivum]                              (838 aa)
 initn: 290 init1: 290 opt: 638  Z-score: 448.9  bits: 93.4 E(): 6.5e-020
Smith-Waterman score: 664; 33.6% identity (46.2% similar) in 539 aa overlap (16-509:123-643)

                                        10        20        30        40        50
MaSP2                           MHHHHHHHHKLAVGGYGPGQQGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQ
                                               : ::: .:   : :::    ::..  .. .  : :  . :
gi|736 GSFYPGETTPPQQLQQRIFWGIPALLKRYYPSVTSPQQVSYYPGQASPQRPGQGQQ----PGQGQQSGQGQQGYYPTSPQ
             90       100       110       120       130           140       150

          60             70        80         90         100          110        120
MaSP2  GPGGYGPGQQG-----PSGPGSAAAAAAAAVGGY-GPGQQG--PGGYGPGQQGPSG---PGSAAAAAAAAVGGY-GPGQQ
        :: .   .::     :..: . .     : :   : ::::  ::   ::    :.   ::.    : .  :   : :::
gi|736 QPGQWQQPEQGQPGYYPTSPQQPGQLQQPAQGQQPGQGQQGRQPGQGQPGYYPTSSQLQPGQLQQPAQGQQGQQPGQGQQ
      160       170       180       190       200       210       220       230

              130         140       150          160       170         180       190
MaSP2  G--PG-GYGPGQQGPSG--PGSAAAAAAAAVGGY-GPGQQG--PGGYGPGQQGPSG--PGSAAAAAAAAVGGYGPGQQGP
       :  :: :  ::: : .:  ::..   . .  :   : ::::  : .   . ::  :  : :    . .  : :  . : :
gi|736 GQQPGQGQQPGQ-GQQGQQPGQGQQPGQGQQGQQLGQGQQGYYPTSLQQSGQGQPGYYPTSLQQLGQGQSGYYPTSPQQP
      240       250        260       270       280       290       300       310

            200        210       220        230       240       250         260
MaSP2  G-GYGPGQ-QGPSGPGSAAAAAAAAVGGYGPGQQGPG-GYGPGQQGPSGPGSAAAAAAAAVGGYGP--GQQGPGGYGPGQ
       : :  ::: : :.   .   .  .   : :   : :: :  :::  :.   ..   .. .  :: :  .::   .  :::
gi|736 GQGQQPGQLQQPAQGQQPEQGQQGQQPGQGQQGQQPGQGQQPGQGQPGYYPTSPQQSGQGQPGYYPTSSQQPTQSQQPGQ
       320       330       340       350       360       370       380       390

      270       280        290         300       310       320            330       340
MaSP2  QGPSGPGSAAAAAAAAVG-GYGPGQQGPGGY--GPGQQGPSGPGSAAAAAAAAVGGYGPGQ-----QGPGGYGPGQQGPS
        : .:   . .  :   : :  :::  :: :  .: :.: . ::   ..   .  :  :::     ::  :  :::
gi|736 -GQQGQQVGQGQQAQQPGQGQQPGQGQPGYYPTSPLQSGQGQPGYYLTSPQQSGQGQQPGQLQQSAQGQKGQQPGQGQQP
        400       410       420       430       440       450       460       470

              350       360       370       380       390       400         410
MaSP2  GPGSAAAAAAAAVGGYGPGQQGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQGPGGY--GPGQQGPSGPGSAAAAAA
       : :. .   . .  :  :::  :: : : .   :: :.  .          :::  :: :  .: : : . ::   ..
gi|736 GQGQQGQQPGQGQQGQQPGQGQPG-YYPTSPQQSGQGQQPGQWQQ------PGQGQPGYYPTSPLQPGQGQPGYDPTSPQ
        480       490       500        510       520             530       540

      420        430       440        450       460         470       480       490
MaSP2  AAVGGYGPGQ-QGPGGYGPGQQGPS-GPGSAAAAAAAAVGGYGP--GQQGPGGYGPGQQGPSGPGSAAAAAAAAVGGY-G
           :  ::: : :.    :::: . . :. .   : .  :  :  ::::    : :::: . ::..   : .  :   :
gi|736 QPGQGQQPGQLQQPA---QGQQGQQLAQGQQGQQPAQVQQGQQPAQGQQGQQ-LGQGQQGQQ-PGQGQQPAQGQQGQQPG
     550       560          570       580       590        600        610       620

             500        510       520
MaSP2  PGQQG--PG-GYGPGQQGPSGPGSAAAAAAAAVGS
        ::::  :: :  :::  :
gi|736 QGQQGQQPGQGQQPGQGQPWYYPTSPQESGQGQQPGQWQQPGQWQQPGQGQPGYYLTSPLQLGQGQQGYYPTSLQQPGQG
          630       640       650       660       670       680       690       700

>>gi|288860106|gid|160|high molecular weight glutenin subunit 5 [Triticum ae  (848 aa)
 initn: 532 init1: 327 opt: 584  Z-score: 412.8  bits: 86.7 E(): 6.6e-018
Smith-Waterman score: 652; 32.8% identity (46.3% similar) in 549 aa overlap (16-513:112-649)

                                        10        20          30        40        50
MaSP2                           MHHHHHHHHKLAVGGYGPGQQGPG--GYGPGQQGPSGPGSAAAAAAAAVGGYGPG
                                               : :.   :   .: ::: .:. ::..   . .  : :  .
gi|288 QYEQQIVVPPKGGSFYPGETTPPQQLQQRIFWGIPALLKRYYPSVTCPQQVSYYPGQASPQRPGQGQQPGQGQQGYYPTS
              80        90       100       110       120       130       140       150

            60             70        80         90         100          110        120
MaSP2  QQGPGGYGPGQQG-----PSGPGSAAAAAAAAVGGY-GPGQQG--PGGYGPGQQGPSG---PGSAAAAAAAAVGGY-GPG
        : :: .   .::     :..: ...     : :   : ::::  ::   ::    :.   ::.    : .  :   : .
gi|288 PQQPGQWQQPEQGQPRYYPTSPQQSGQLQQPAQGQQPGQGQQGQQPGQGQPGYYPTSSQLQPGQLQQPAQGQQGQQPGQA
             160       170       180       190       200       210       220       230

                130         140       150          160       170         180       190
MaSP2  QQG--PG-GYGPGQQGPSG--PGSAAAAAAAAVGGY-GPGQQG--PGGYGPGQQGPSG--PGSAAAAAAAAVGGYGPGQQ
       :::  :: :  ::: : .:  ::..   . .  :   : ::::  : .   . ::  :  : :    . .  : :  . :
gi|288 QQGQQPGQGQQPGQ-GQQGQQPGQGQQPGQGQQGQQLGQGQQGYYPTSLQQSGQGQPGYYPTSLQQLGQGQSGYYPTSPQ
             240        250       260       270       280       290       300       310

              200        210       220        230       240       250         260
MaSP2  GPG-GYGPGQ-QGPSGPGSAAAAAAAAVGGYGPGQQGPG-GYGPGQQGPSGPGSAAAAAAAAVGGYGP--GQQGPGGYGP
        :: :  ::: : :.   . . .  .   : :   : :: :  :::  :.   ..   .. .  :: :  .::   .  :
gi|288 QPGQGQQPGQLQQPAQGQQPGQGQQGQQPGQGQQGQQPGQGQQPGQGQPGYYPTSPQQSGQGQPGYYPTSSQQPTQSQQP
              320       330       340       350       360       370       380       390

        270       280        290         300       310       320            330
MaSP2  GQQGPSGPGSAAAAAAAAVG-GYGPGQQGPGGY--GPGQQGPSGPGSAAAAAAAAVGGYGPGQ-----QGPGGYGPGQQG
       :: : .:   . .  :   : :  :::  :: :  .: :.: . ::   ..   .  :  :::     ::  :  :::
gi|288 GQ-GQQGQQVGQGQQAQQPGQGQQPGQGQPGYYPTSPQQSGQGQPGYYLTSPQQSGQGQQPGQLQQSAQGQKGQQPGQGQ
               400       410       420       430       440       450       460

      340       350       360       370       380       390       400         410
MaSP2  PSGPGSAAAAAAAAVGGYGPGQQGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQGPGGY--GPGQQGPSGPGSAAAA
         : :. .   . .  :  :::  :: : : .   :: :.  .          :::  :: :  .: : : . ::   ..
gi|288 QPGQGQQGQQPGQGQQGQQPGQGQPG-YYPTSPQQSGQGQQPGQWQQ------PGQGQPGYYPTSPLQPGQGQPGYDPTS
     470       480       490        500       510             520       530       540

        420        430          440         450         460        470        480
MaSP2  AAAAVGGYGPGQ-QGPGGYGPGQQ---GPSG--PGSAAAAAAAAVGGYG--PGQQGPGGY-GPGQQGPS-GPGSAAAAAA
             :  ::: : :.    :::   : .:  :...  .   : :  :  :::   :   : :::: . : :. .   :
gi|288 PQQPGQGQQPGQLQQPAQGQQGQQLAQGQQGQQPAQVQQGQRPAQGQQGQQPGQGQQGQQLGQGQQGQQPGQGQQGQQPA
            550       560       570       580       590       600       610       620

        490       500       510         520
MaSP2  AAVGGYGPGQQGPGGYGPGQQGPSG--PGSAAAAAAAAVGS
        .  :  ::: :  :  ::: : .:  ::
gi|288 QGQQGQQPGQ-GQQGQQPGQ-GQQGQQPGQGQQPGQGQPWYYPTSPQESGQGQQPGQWQQPGQGQPGYYLTSPLQLGQGQ
            630        640        650       660       670       680       690       700

>>gi|21751|gid|160|high molecular weight glutenin subunit 10 [Triticum aesti  (648 aa)
 initn: 108 init1: 108 opt: 489  Z-score: 350.4  bits: 74.8 E(): 2e-014
Smith-Waterman score: 555; 31.6% identity (45.5% similar) in 538 aa overlap (13-522:124-618)

                                           10        20          30        40        50
MaSP2                              MHHHHHHHHKLAVGGYGPGQQGP--GGYGPGQQGPSGPGSAAAAAAAAVGGY
                                               : :: ::  .:  :.: ::: .:. ::.          :
gi|217 VARQYEQTVVPPKGGSFYPGETTPLQQLQQGIFWGTSSQTVQGYYPGVTSPRQGSYYPGQASPQQPGQ----------GQ
            90       100       110       120       130       140       150

               60           70        80        90           100       110       120
MaSP2  GPGQQGPGGYGPGQQGPSG---PGSAAAAAAAAVGGYGPGQQGPG----GYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQ
        ::.    : :     :..   ::..   . .  : :  . : ::    :: : .   .:  .  . .     :  :::
gi|217 QPGKWQEPGQGQQWYYPTSLQQPGQGQQIGKGQQGYYPTSLQQPGQGQQGYYPTSLQHTGQRQQPVQGQQPEQGQQPGQW
           160       170       180       190       200       210       220       230

             130        140       150       160       170       180       190       200
MaSP2  GPGGY--GPGQQGPSG-PGSAAAAAAAAVGGYGPGQQGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQGPGGYGPGQ
         : :  .: : : .  : .   .. .  : :  . : ::    ::::    .:    . .  : :  .:: ::    ::
gi|217 QQGYYPTSPQQLGQGQQPRQWQQSGQGQQGHYPTSLQQPG---QGQQGHYL-ASQQQPGQGQQGHYPASQQQPG---QGQ
           240       250       260       270          280        290       300

              210       220       230       240       250       260       270       280
MaSP2  QGPSGPGSAAAAAAAAVGGYGPGQQGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQGPGGYGPGQQGPSGPGSAAAA
       ::   :.:    . .  : :  .:: ::    :::: . :.:    . .  : :  . : ::    ::::   : :
gi|217 QG-HYPASQQQPGQGQQGHYPASQQEPG---QGQQG-QIPASQQQPGQGQQGHYPASLQQPG---QGQQGHY-PTSLQQL
         310       320       330           340       350       360          370

              290        300         310       320       330          340       350
MaSP2  AAAAVGGYGPGQ-QGPGGYGPGQQGPSG--PGSAAAAAAAAVGGYGPGQQGPG-GY--GPGQQGPSGPGSAAAAAAAAVG
       . .   :  ::: : :   : :::  .:  : .    . .  : :  . : :: :   : ::::   : :    . .  :
gi|217 GQGQQTGQ-PGQKQQP---GQGQQTGQGQQPEQEQQPGQGQQGYYPTSLQQPGQGQQQGQGQQGYY-PTSLQQPGQGQQG
       380        390          400       410       420       430        440       450

          360       370       380       390       400       410       420       430
MaSP2  GYGPGQQGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQGPG-G
        :  . : ::   :::.   : :.          :  ::: :  :: :   .:. ::..   . .  : :  . : :: :
gi|217 HYPASLQQPGQGQPGQRQQPGQGQHPEQ------GKQPGQ-GQQGYYP--TSPQQPGQGQQLGQGQQGYYPTSPQQPGQG
            460       470       480              490         500       510       520

             440         450            460       470       480       490       500
MaSP2  YGPGQ--QG--PSGPGSAAAAAAAAVGGY-G----PGQQGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQGPGGYGP
         :::  ::  :..: ... :   . :   :    ::: :  :: : .    : :. .. .  .  :. ::: :  . :
gi|217 QQPGQGQQGHCPTSPQQSGQAQQPGQGQQIGQVQQPGQ-GQQGYYPTSVQQPGQGQQSGQGQQSGQGHQPGQ-GQQS-GQ
           530       540       550       560        570       580       590         600

          510       520
MaSP2  GQQGPSGPGSAAAAAAAAVGS
        ::: ..:  ..:   ::
gi|217 EQQGYDSPYHVSAEQQAASPMVAKAQQPATQLPTVCRMEGGDALSASQ
              610       620       630       640

>>gi|21779|gid|160|HMW glutenin-like protein product [Triticum aestivum]      (660 aa)
 initn: 126 init1: 126 opt: 480  Z-score: 344.4  bits: 73.7 E(): 4.3e-014
Smith-Waterman score: 573; 32.4% identity (45.8% similar) in 555 aa overlap (13-522:124-630)

                                           10        20          30        40        50
MaSP2                              MHHHHHHHHKLAVGGYGPGQQGP--GGYGPGQQGPSGPGSAAAAAAAAVGGY
                                               : :: :.  .:  :.: ::: .:. ::.          :
gi|217 VARQYEQTVVPPKGGSFYPGETTPLQQLQQGIFWGTSSQTVQGYYPSVTSPRQGSYYPGQASPQQPGQ----------GQ
            90       100       110       120       130       140       150

               60           70        80        90          100         110        120
MaSP2  GPGQQGPGGYGPGQQGPSG---PGSAAAAAAAAVGGYGPGQQGPGG---YGPGQQG--PSGPGSAAAAAAAAVGGY-GPG
        ::.    : :     :..   ::..   . .  : :  . : ::     : ::::  :..:  ..     . :   : :
gi|217 QPGKWQEPGQGQQWYYPTSLQQPGQGQQIGKGKQGYYPTSLQQPGQGQQIGQGQQGYYPTSPQHTGQRQQPVQGQQIGQG
           160       170       180       190       200       210       220       230

             130           140       150       160          170       180       190
MaSP2  QQGPGGYGPGQ--QG--PSGPGSAAAAAAAAVGGYGPGQQGPGGY---GPGQQGPSGPGSAAAAAAAAVGGYGPGQQGPG
       ::   :  :::  ::  :..: .            : ::: :: .   : ::::   : :    . .  : :  .:: :.
gi|217 QQPEQGQQPGQWQQGYYPTSPQQ-----------LGQGQQ-PGQWQQSGQGQQG-HYPTSLQQPGQGQQGHYLASQQQPA
           240       250                  260        270        280       290       300

          200       210       220           230       240       250       260
MaSP2  GYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQGPG----GYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQG--PGGYG-PG
           ::::   :.:    . .  : :  .:: ::    :. :..:   : :. .   :.     : ::::  :..   ::
gi|217 ---QGQQG-HYPASQQQPGQGQQGHYPASQQQPGQGQQGHYPASQQEPGQGQQGQIPASQ-QQPGQGQQGHYPASLQQPG
                  310       320       330       340       350        360       370

       270       280       290        300         310       320       330          340
MaSP2  QQGPSGPGSAAAAAAAAVGGYGPGQ-QGPGGYGPGQQGPSG--PGSAAAAAAAAVGGYGPGQQGPG-GY--GPGQQGPSG
       :::    .    . .  .:   ::: : :   : :::  .:  : .    . .  : :  . : :: :   : ::::
gi|217 QQGHYPTSLQQLGQGQQIG--QPGQKQQP---GQGQQTGQGQQPEQEQQPGQGQQGYYPTSLQQPGQGQQQGQGQQGYY-
         380       390         400          410       420       430       440

             350       360         370       380       390       400       410
MaSP2  PGSAAAAAAAAVGGYGPGQQGPG-GYG-PGQQGPSGPGSAAAAAAAAVGGYGPGQQGPGGYGPGQQGPSGPGSAAAAAAA
       : :    . .  : :  . : :: : : :::.   : :.          :  ::: :  :: :   .:. ::..   . .
gi|217 PTSLQQPGQGQQGHYPASLQQPGQGQGQPGQRQQPGQGQHPEQ------GQQPGQ-GQQGYYP--TSPQQPGQGQQLGQG
     450       460       470       480       490              500         510       520

     420       430        440         450       460              470       480
MaSP2  AVGGYGPGQQGPG-GYGPGQQGPSG--PGSAAAAAAAAVGGYGP--GQ-QGPG----GYGPGQQGPSGPGSAAAAAAAAV
         : :  . : :: :  ::: : .:  : :   .. :   : :   :: : ::    :: : .    : :. .. .  .
gi|217 QQGYYPTSPQQPGQGQQPGQ-GQQGHCPMSPQQTGQAQQLGQGQQIGQVQQPGQGQQGYYPTSLQQPGQGQQSGQGQQSG
              530       540        550       560       570       580       590

     490       500         510       520
MaSP2  GGYGPGQQGPGGYGPGQ--QGPSGPGSAAAAAAAAVGS
        :. :::    :   ::  :: ..:  ..:   ::
gi|217 QGHQPGQ----GQQSGQEKQGYDSPYHVSAEQQAASPMVAKAQQPATQLPTVCRMEGGDALSASQ
     600           610       620       630       640       650       660

>>gi|71084277|gid|160|HMW glutenin x-type subunit Bx7 precursor [Triticum ae  (795 aa)
 initn: 151 init1: 151 opt: 471  Z-score: 337.7  bits: 72.7 E(): 1e-013
Smith-Waterman score: 605; 32.6% identity (49.1% similar) in 601 aa overlap (15-522:174-765)

                                         10        20           30        40         50
MaSP2                            MHHHHHHHHKLAVGGYGPGQQGPGGY---GPGQQGPSGPGSAAAAAAAAVG-GY
                                               :: : .: ::     : :::. .:  .   ..    : :
gi|710 QEQQPGQGQQDQQPGQRQQGYYPTSPQQPGQGQQLGQGQPGYYPTSQQPGQKQQAGQGQQSGQGQQGYYPTSPQQSGQGQ
           140       150       160       170       180       190       200       210

               60           70        80        90          100            110
MaSP2  GPGQQGPGGY--GPGQQGP-SGPGSAAAAAAAAVGGYGPGQQGPG-GYGPGQ--QG-----PSGPGSAAAAAAAAVGGYG
        :::  :: :  .: :.:  . ::..   . .  .: :   : :: :  :::  ::     :. ::..  .. .  : :
gi|710 QPGQGQPGYYPTSPQQSGQWQQPGQGQQPGQGQQSGQGQQGQQPGQGQRPGQGQQGYYPISPQQPGQGQQSGQGQPGYYP
           220       230       240       250       260       270       280       290

          120         130         140       150                 160        170
MaSP2  -----PGQ-QGPG-GYGPGQQGPSG--PGSAAAAAAAAVGGYG-----PGQ-----QGPGGYGP-GQQGPSG--PGSAAA
            ::: : :: :  ::: : .:  ::..  .. .  : :      :::     ::  :: : .::. .:  ::..
gi|710 TSLRQPGQWQQPGQGQQPGQ-GQQGQQPGQGQQSGQGQQGYYPTSLQQPGQGQQLGQGQPGYYPTSQQSEQGQQPGQGKQ
           300       310        320       330       340       350       360       370

       180         190        200         210         220         230         240
MaSP2  AAAAAVGGY--GPGQQGPGGY-GPGQQG--PSGPGSAAAAAAAAVG--GYGPG--QQGPGGYGPGQQGPSG--PGSAAAA
        . .  : :  .: :.: :   : :: :  :..: ... .  .. :  :: :   ::.  :  ::: : ::  : :   .
gi|710 PGQGQQGYYPTSPQQSGQGQQLGQGQPGYYPTSPQQSGQGQQSGQGQQGYYPTSPQQSGQGQQPGQ-GQSGYFPTSRQQS
            380       390       400       410       420       430        440       450

        250        260        270       280         290            300       310
MaSP2  AAAAVGGYGPGQ-QGPGGYGPGQ-QGPSGPGSAAAAAAAAVGGYG--PGQQGPGGYG-----PGQQGPSGPGSAAAA---
       . .   : :  . ::  :  ::: :    : :.  .     .:    :::  :: :      :::.  :: .. ..
gi|710 GQGQQPGQGQQSGQGQQGQQPGQGQQAYYPTSSQQSRQRQQAGQWQRPGQGQPGYYPTSPQQPGQEQQSGQAQQSGQWQL
             460       470       480       490       500       510       520       530

          320        330          340         350       360       370         380
MaSP2  AAAAVGGYGPGQ-QGPG-GYGP--GQQGPS--GPGSAAAAAAAAVGGYGPGQQGPGGYGPGQQGPSG--PGSAAAAAAAA
       .   ..   ::: : :. :  :  :::. .   ::.:  ..   .  :  . : ::      :: .:  : :   .. .
gi|710 VYYPTSPQQPGQLQQPAQGQQPAQGQQSAQEQQPGQAQQSGQWQLVYYPTSPQQPGQLQQPAQGQQGYYPTSPQQSGQGQ
             540       550       560       570       580       590       600       610

        390           400           410       420         430        440         450
MaSP2  VGGY--GPGQQGPG--GYGPG--QQGPSG--PGSAAAAAAAAVGGY--GPGQQGPGGY-GPGQQG--PSGPGSAAAAAAA
        : :  .: :.: :  :: :   ::. .:  ::..     .  : :  .: :.: :   : ::::  :..: ...
gi|710 QGYYPTSPQQSGQGQQGYYPTSPQQSGQGQQPGQGQQPRQGQQGYYPISPQQSGQGQQPGQGQQGYYPTSPQQSGQ----
             620       630       640       650       660       670       680

           460           470         480       490            500        510       520
MaSP2  AVGGYGPG-QQGPGGY---GPGQQG--PSGPGSAAAAAAAAVG--GYGPG---QQGPGGY-GPGQQGPSGPGSAAAAAAA
          :  :: .: :: .   : ::::  :..  ... .  .. :  :: :    : : :   : :::: ..:  ..:   :
gi|710 ---GQQPGHEQQPGQWLQPGQGQQGYYPTSSQQSGQGHQSGQGQQGYYPTSLWQPGQGQQPGQGQQGYASPYHVSAEYQA
          690       700       710       720       730       740       750       760


MaSP2  AVGS
       :
gi|710 ARLKVAKAQQLAAQLPAMCRLEGSDALSTRQ
          770       780       790

>>gi|21743|gid|160|high molecular weight glutenin subunit 1Ax1 [Triticum aes  (830 aa)
 initn: 172 init1: 172 opt: 459  Z-score: 329.6  bits: 71.3 E(): 2.9e-013
Smith-Waterman score: 550; 32.0% identity (44.3% similar) in 575 aa overlap (17-522:258-800)

                                       10        20           30        40        50
MaSP2                          MHHHHHHHHKLAVGGYGPGQQ-GPGGYG--PGQQGPSGPGSAAAAAAAAVGGYGPG
                                               : ::: : :  :  :::.  :: :. .    .     : :
gi|217 SGQGQPRYYPTSSQQPGQLQQLAQGQQGQQPERGQQGQQSGQGQQLGQGQQGQQPGQKQQSGQGQQGYYPISP-QQLGQG
       220       230       240       250       260       270       280       290

               60        70        80          90               100       110
MaSP2  QQ-GPG--GYGPGQQGPSGPGSAAAAAAAAVGGYGPGQ--QGPGGYGPGQ--------QGPSGPGSAAAAAAAAVG-GYG
       :: : :  :: : .   :: :...   ..:     :::  :.      ::        :: .:  :.        : :
gi|217 QQSGQGQLGYYPTSPQQSGQGQSGYYPTSA---QQPGQLQQSTQEQQLGQEQQDQQSGQGRQGQQSGQRQQDQQSGQGQQ
        300       310       320          330       340       350       360       370

     120         130       140       150            160         170       180
MaSP2  PGQQGPGGYG--PGQQGPSGPGSAAAAAAAAVGGYGPGQ-----QGPGGYGP--GQQGPSGPGSAAAAAAAAVGGYG--P
       :::. :: :.  : : : . :    ..         : :     ::  :  :  :::: . ::..  .   . :  :  :
gi|217 PGQRQPGYYSTSPQQLGQGQPRYYPTSPQQPGQEQQPRQLQQPEQGQQGQQPEQGQQGQQ-PGQGEQGQQPGQGQQGQQP
           380       390       400       410       420       430        440       450

      190              200         210       220       230        240       250
MaSP2  GQQGPGGY-------GPGQQG--PSGPGSAAAAAAAAVGGYGPGQQGPGGY-GPGQQGPSGPGSAAAAAAAAVGGYGPGQ
       ::  :: :       : :: :  :..: ...     :  :  :::.  :   : :::: . ::.          :  :::
gi|217 GQGQPGYYPTSPQQSGQGQPGYYPTSPQQSGQLQQPA-QGQQPGQEQQGQQPGQGQQGQQ-PGQ----------GQQPGQ
            460       470       480        490       500       510                  520

      260         270                280       290         300        310       320
MaSP2  QGPGGY--GPGQQGP---------SGPGSAAAAAAAAVGGYGPGQQGPGGY--GPGQQGPSG-PGSAAAAAAAAVGGY-G
         :: :  .: :.:          :: :. .   .. .    :::  :: :  .: : : .  ::.    . .  :   :
gi|217 GQPGYYPTSPQQSGQEQQLEQWQQSGQGQPGHYPTSPL---QPGQGQPGYYPTSPQQIGQGQQPGQLQQPTQGQQGQQPG
              530       540       550          560       570       580       590

           330       340       350              360        370        380       390
MaSP2  PGQQGPGGYGPGQQGPSGPGSAAAAAAAAVGGY-------GPGQQGPGGYG-PGQQGPSG-PGSAAAAAAAAVGGYGPGQ
        ::::    : :::: . ::..   . .  : :       : ::: :: .  :::  :.  : :.     .  : :  .:
gi|217 QGQQGQQP-GQGQQGQQ-PGQGQQPGQGQPGYYPTSLQQSGQGQQ-PGQWQQPGQGLPGYYPTSSLQPEQGQQGYYPTSQ
       600        610        620       630       640        650       660       670

           400       410       420         430       440       450       460       470
MaSP2  QGPG-GYGPGQQGPSGPGSAAAAAAAAVGGYGPG--QQGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQGPGGYGP-
       : :: :  :::   :: :.          :: :   ::.  :  :::    :    ..   ..    : :::      :
gi|217 QQPGQGPQPGQWQQSGQGQQ---------GYYPTSPQQSGQGQQPGQWLQPGQWLQSGYYLTSPQQLGQGQQPRQWLQPR
          680       690                700       710       720       730       740

                 480       490       500       510       520
MaSP2  -GQQG--PSGPGSAAAAAAAAVGGYGPGQQGPGGYGPGQQGPSGPGSAAAAAAAAVGS
        ::::  :..: ... .   . :  :    .:   : :::: ..:  ..:   ::
gi|217 QGQQGYYPTSPQQSGQGQQLGQGQQGYYPTSPQQSGQGQQGYDSPYHVSAEHQAASLKVAKAQQLAAQLPAMCRLEGGDA
         750       760       770       780       790       800       810       820

gi|217 LLASQ
         830

>>gi|22090|gid|160|HMW glutenin subunit 1By9 [Triticum aestivum]              (705 aa)
 initn: 104 init1: 104 opt: 457  Z-score: 328.8  bits: 70.9 E(): 3.1e-013
Smith-Waterman score: 576; 32.6% identity (47.5% similar) in 564 aa overlap (13-512:124-665)

                                           10            20        30        40
MaSP2                              MHHHHHHHHKLAVGGYGPG----QQGPGGYGPGQQGPSGPGSAAAAAAAAVG
                                               : :: :.    ::::  : ::: .:. ::..   .
gi|220 VVRQYEQTVVPPKGGSFYPGETTPLQQLQQVIFWGTSSQTVQGYYPSVSSPQQGP--YYPGQASPQQPGQGQQPGKWQ--
            90       100       110       120       130         140       150

       50          60                70        80        90          100         110
MaSP2  GYGPGQQG--PGGY---GPGQQG--PSG---PGSAAAAAAAAVGGYGPGQQGPGG---YGPGQQG--PSGPGSAAAAAAA
         : ::::  : .    : ::::  ::.   ::..   . .  : :  . : ::     : ::::  :..:   .
gi|220 ELGQGQQGYYPTSLHQSGQGQQGYYPSSLQQPGQGQQIGQGQQGYYPTSLQQPGQGQQIGQGQQGYYPTSPQHPGQRQQP
     160       170       180       190       200       210       220       230

            120         130       140       150             160       170       180
MaSP2  AVGGY-GPGQQ-GPG-GYGPGQQGPSGPGSAAAAAAAAVG-GYGPGQ-----QGPGGYGPGQQGPSGPGSAAAAAAAAVG
       . :   : ::: : :   : :::. .:  .   ..   .: :  :::     ::  :: : .:   : :. .   :.
gi|220 GQGQQIGQGQQLGQGRQIGQGQQSGQGQQGYYPTSPQQLGQGQQPGQWQQSGQGQQGYYPTSQQQPGQGQQGQYPASQ-Q
     240       250       260       270       280       290       300       310

          190            200       210       220       230       240       250
MaSP2  GYGPGQQG--PGGY---GPGQQGPSGPGSAAAAAAAAVGGYGPGQQGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQ
         : ::::  :..    : ::::   :.:    . .  : :  .:: :   : :::    :.:    . .  : :  . :
gi|220 QPGQGQQGQYPASQQQPGQGQQGQY-PASQQQPGQGQQGHYLASQQQP---GQGQQRHY-PASLQQPGQGQQGHYTASLQ
      320       330       340        350       360          370        380       390

     260       270       280       290         300         310         320       330
MaSP2  GPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQ-GPGGYG-PGQQGPSG--PGSAAAAAAAAV--GGYGPGQQGPGGYG
        ::    ::::    .   .. .  .:  :  :: : :     :::  .:  ::..  .  .     :  ::: :  ::
gi|220 QPG---QGQQGHYPASLQQVGQGQQIGQLGQRQQPGQGQQTRQGQQLEQGQQPGQGQQTRQGQQLEQGQQPGQ-GQQGYY
              400       410       420       430       440       450       460

             340         350       360              370         380       390       400
MaSP2  PG--QQGPSG--PGSAAAAAAAAVGGYGPGQQGPG----GYGPG---QQGPSGPGS--AAAAAAAAVGGYGPGQQGPGGY
       :   ::. .:  ::..   . .  : :. . : ::    :. :.   : : . ::.    . .     :  ::: :  ::
gi|220 PTSPQQSGQGQQPGQSQQPGQGQQGYYSSSLQQPGQGLQGHYPASLQQPGQGHPGQRQQPGQGQQPEQGQQPGQ-GQQGY
     470       480       490       500       510       520       530       540

              410       420       430          440         450            460       470
MaSP2  GPGQQGPSGPGSAAAAAAAAVGGYGPGQQGPG-GYGPGQ--QG--PSGPGSAAAAAAAAVGGY-G----PGQQGPGGYGP
        :   .:. ::..   . .  : :  . : :: :  :::  ::  :..: ... :   . :   :    ::: :  :: :
gi|220 YP--TSPQQPGQGKQLGQGQQGYYPTSPQQPGQGQQPGQGQQGHCPTSPQQTGQAQQPGQGQQIGQVQQPGQ-GQQGYYP
      550         560       570       580       590       600       610        620

              480       490         500       510       520
MaSP2  GQQGPSGPGSAAAAAAAAVGGY--GPGQQGPGGYGPGQQGPSGPGSAAAAAAAAVGS
        .   :: :. .. .  .  :.  : :::.    :  ::: ..:
gi|220 ISLQQSGQGQQSGQGQQSGQGHQLGQGQQS----GQEQQGYDNPYHVNTEQQTASPKVAKVQQPATQLPIMCRMEGGDAL
         630       640       650           660       670       680       690       700

gi|220 SASQ


>>gi|508732625|gid|160|high molecular weight glutenin subunit, partial [Trit  (794 aa)
 initn: 153 init1: 153 opt: 426  Z-score: 307.7  bits: 67.2 E(): 4.7e-012
Smith-Waterman score: 538; 31.9% identity (43.1% similar) in 562 aa overlap (17-522:231-764)

                                       10        20           30        40        50
MaSP2                          MHHHHHHHHKLAVGGYGPGQQ-GPGGYG--PGQQGPSGPGSAAAAAAAAVGGYGPG
                                               : ::: : :  :  :::.  :: :. .    .     : :
gi|508 SGQGQPRYYPTSSQQPGQLQQLAQGQQGQQPERGQQGQQSGQGQQLGQGQQGQQPGQKQQSGQGQQGYYPISP-QQLGQG
              200       210       220       230       240       250       260

               60        70        80          90               100       110
MaSP2  QQ-GPG--GYGPGQQGPSGPGSAAAAAAAAVGGYGPGQ--QGPGGYGPGQ--------QGPSGPGSAAAAAAAAVG-GYG
       :: : :  :: : .   :: :...   ..:     :::  :.      ::        :: .:  :.        : :
gi|508 QQSGQGQLGYYPTSPQQSGQGQSGYYPTSA---QQPGQLQQSTQEQQLGQEQQDQQSGQGRQGQQSGQRQQDQQSGQGQQ
     270       280       290          300       310       320       330       340

     120         130       140       150            160         170        180
MaSP2  PGQQGPGGYG--PGQQGPSGPGSAAAAAAAAVGGYGPGQ-----QGPGGYGP--GQQGPSG-PGSAAAAAAAAVGGYGPG
       :::. :: :.  : : : . :    ..         : :     ::  :  :  :::: .   :  .   . .  :  ::
gi|508 PGQRQPGYYSTSPQQLGQGQPRYYPTSPQQPGQEQQPRQLQQPEQGQQGQQPEQGQQGQQQRQGEQGQQPGQGQQGQQPG
        350       360       370       380       390       400       410       420

     190              200         210       220            230           240        250
MaSP2  QQGPGGY-------GPGQQG--PSGPGSAAAAAAAAVGGYGPGQ--QG--PG-GYGPGQ--QG--PSGPG-SAAAAAAAA
       :  :: :       : :: :  :..: ...     :  :  :::  ::  :: :  :::   :  :..:  :.
gi|508 QGQPGYYPTSPQQSGQGQPGYYPTSPQQSGQLQQPA-QGQQPGQEQQGQQPGQGQQPGQGQPGYYPTSPQQSGQEQQLEQ
        430       440       450       460        470       480       490       500

              260       270       280       290        300       310       320
MaSP2  VGGYGPGQQGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQ-QGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQGP
           : :: :    .: : : . ::   ..      :  ::: : :     :::: . ::..  .        : ::::
gi|508 WQQSGQGQPGHYPTSPLQPGQGQPGYYPTSPQQIGQGQQPGQLQQPT---QGQQGQQ-PGQGQQGQQP-----GEGQQGQ
         510       520       530       540       550           560            570

     330        340       350       360        370       380       390        400
MaSP2  GGYGPGQQ-GPSGPGSAAAAAAAAVGGYGPGQ-QGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQGPG-GYGPGQQG
          : ::: : . ::   ..   .  :  ::: : ::   ::      : :.     .  : :  .:: :: :  :::
gi|508 QP-GQGQQPGQGQPGYYPTSLQQSGQGQQPGQWQQPGQGQPGYY----PTSSLQPEQGQQGYYPTSQQQPGQGPQPGQWQ
         580       590       600       610           620       630       640       650

        410       420         430       440       450       460       470           480
MaSP2  PSGPGSAAAAAAAAVGGYGPG--QQGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQGPGGYGP--GQQG--PSGPGS
        :: :.          :: :   ::.  :  :::    :    ..   ..    : :::      :  ::::  :..: .
gi|508 QSGQGQQ---------GYYPTSPQQSGQGQQPGQWLQPGQWLQSGYYLTSPQQLGQGQQPRQWLQPRQGQQGYYPTSPQQ
                      660       670       680       690       700       710       720

              490       500       510       520
MaSP2  AAAAAAAAVGGYGPGQQGPGGYGPGQQGPSGPGSAAAAAAAAVGS
       .. .   . :  :    .:   : :::: ..:  ..:   ::
gi|508 SGQGQQLGQGQQGYYPTSPQQSGQGQQGYDSPYHVSAEHQAASLKVAKAQQLAAQLPAMCRLEGGDALSASQ
            730       740       750       760       770       780       790

>>gi|170743|gid|160|HMW glutenin subunit Ax2* [Triticum aestivum]             (815 aa)
 initn: 153 init1: 153 opt: 426  Z-score: 307.6  bits: 67.2 E(): 4.8e-012
Smith-Waterman score: 538; 31.9% identity (43.1% similar) in 562 aa overlap (17-522:252-785)

                                       10        20           30        40        50
MaSP2                          MHHHHHHHHKLAVGGYGPGQQ-GPGGYG--PGQQGPSGPGSAAAAAAAAVGGYGPG
                                               : ::: : :  :  :::.  :: :. .    .     : :
gi|170 SGQGQPRYYPTSSQQPGQLQQLAQGQQGQQPERGQQGQQSGQGQQLGQGQQGQQPGQKQQSGQGQQGYYPISP-QQLGQG
             220       230       240       250       260       270       280        290

               60        70        80          90               100       110
MaSP2  QQ-GPG--GYGPGQQGPSGPGSAAAAAAAAVGGYGPGQ--QGPGGYGPGQ--------QGPSGPGSAAAAAAAAVG-GYG
       :: : :  :: : .   :: :...   ..:     :::  :.      ::        :: .:  :.        : :
gi|170 QQSGQGQLGYYPTSPQQSGQGQSGYYPTSA---QQPGQLQQSTQEQQLGQEQQDQQSGQGRQGQQSGQRQQDQQSGQGQQ
              300       310       320          330       340       350       360

     120         130       140       150            160         170        180
MaSP2  PGQQGPGGYG--PGQQGPSGPGSAAAAAAAAVGGYGPGQ-----QGPGGYGP--GQQGPSG-PGSAAAAAAAAVGGYGPG
       :::. :: :.  : : : . :    ..         : :     ::  :  :  :::: .   :  .   . .  :  ::
gi|170 PGQRQPGYYSTSPQQLGQGQPRYYPTSPQQPGQEQQPRQLQQPEQGQQGQQPEQGQQGQQQRQGEQGQQPGQGQQGQQPG
       370       380       390       400       410       420       430       440

     190              200         210       220            230           240        250
MaSP2  QQGPGGY-------GPGQQG--PSGPGSAAAAAAAAVGGYGPGQ--QG--PG-GYGPGQ--QG--PSGPG-SAAAAAAAA
       :  :: :       : :: :  :..: ...     :  :  :::  ::  :: :  :::   :  :..:  :.
gi|170 QGQPGYYPTSPQQSGQGQPGYYPTSPQQSGQLQQPA-QGQQPGQEQQGQQPGQGQQPGQGQPGYYPTSPQQSGQEQQLEQ
       450       460       470       480        490       500       510       520

              260       270       280       290        300       310       320
MaSP2  VGGYGPGQQGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQ-QGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQGP
           : :: :    .: : : . ::   ..      :  ::: : :     :::: . ::..  .        : ::::
gi|170 WQQSGQGQPGHYPTSPLQPGQGQPGYYPTSPQQIGQGQQPGQLQQPT---QGQQGQQ-PGQGQQGQQP-----GEGQQGQ
        530       540       550       560       570          580        590

     330        340       350       360        370       380       390        400
MaSP2  GGYGPGQQ-GPSGPGSAAAAAAAAVGGYGPGQ-QGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQGPG-GYGPGQQG
          : ::: : . ::   ..   .  :  ::: : ::   ::      : :.     .  : :  .:: :: :  :::
gi|170 QP-GQGQQPGQGQPGYYPTSLQQSGQGQQPGQWQQPGQGQPGYY----PTSSLQPEQGQQGYYPTSQQQPGQGPQPGQWQ
        600       610       620       630       640           650       660       670

        410       420         430       440       450       460       470           480
MaSP2  PSGPGSAAAAAAAAVGGYGPG--QQGPGGYGPGQQGPSGPGSAAAAAAAAVGGYGPGQQGPGGYGP--GQQG--PSGPGS
        :: :.          :: :   ::.  :  :::    :    ..   ..    : :::      :  ::::  :..: .
gi|170 QSGQGQQ---------GYYPTSPQQSGQGQQPGQWLQPGQWLQSGYYLTSPQQLGQGQQPRQWLQPRQGQQGYYPTSPQQ
                     680       690       700       710       720       730       740

              490       500       510       520
MaSP2  AAAAAAAAVGGYGPGQQGPGGYGPGQQGPSGPGSAAAAAAAAVGS
       .. .   . :  :    .:   : :::: ..:  ..:   ::
gi|170 SGQGQQLGQGQQGYYPTSPQQSGQGQQGYDSPYHVSAEHQAASLKVAKAQQLAAQLPAMCRLEGGDALLASQ
           750       760       770       780       790       800       810



525 residues in 1 query   sequences
443551 residues in 1897 library sequences
 Scomplib [35.04]
 start: Wed Sep 16 15:00:15 2015 done: Wed Sep 16 15:00:15 2015
 Total Scan time:  0.000 Total Display time:  0.000

Function used was FASTA [version 35.04 Jan. 15, 2009]